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GENE EXPRESSION SYSTEM 

BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001] The present invention relates to compositions and methods for the 

production of polypeptides and/or untranslated RNA molecules in host cells. 
The invention provides nucleic acid molecules, expression systems, host cells, 
methods and kits that are useful for the production of polypeptides and/or 
untranslated RNA molecules. In a preferred embodiment, the invention allows 
regulated and/or inducible expression of a nucleotide sequence of interest in a 
host cell. 

Related Art 

[0002] Process development for biopharmaceuticals is influenced by product 

quality and economy of the manufacturing process. The economic production 
of recombinant proteins in mammalian cells is dependent on the selection of 
the producing cell lines. The classical approach is the use of stable expression 
systems. These systems are based on chromosomal integration of an 
expression plasmid into the genome of the host cell. 

[0003] An alternative to stable expression systems is transient gene 

expression. For example, transient gene expression in, e.g., mammalian cells 
at reactor scale is becoming increasingly important for the rapid production of 
recombinant proteins. Large scale applications of transient expression systems 
for recombinant protein production have been reported using COS, HEK 293 
and BHK21 cells (Blasey et al., 1996, Cytotechnology, 18, 138-192; Jordan et 
al., 1998, Cytotechnology, 26, 39-47; Wurm and Bernard, 1999, Curr Opin 
Biotechn 10, 156-159; Meissner et al., 2001, Biotechn andBioeng, 75, 2, 197- 
203). 

[0004] Transient expression can be performed by infection (using, e.g., viral 

vectors) or by transfection of the appropriate vector. Transfection can be 



performed with either a non-replicating or a replicating DNA vector or with a 
virus derived RNA vector. 
[0005] A variety of virus vectors for expression in higher eukaryotic cells 

have been developed by using recombinant DNA technology and introducing 
genes of interest into virus genomes. The initial viruses used in vector 
development were DNA viruses such as Adeno-associated viruses or 
Poxviruses as well as RNA viruses replicating through DNA intermediates 
such as Retroviruses. 

[0006] Episomal replicating vectors provide certain advantages over classical 

expression systems. Some DNA viruses, such as Adeno-, Papilloma-, 
Polyoma-, Hepadna-, and Herpesviruses, do not normally integrate into the 
host genome, but replicate episomally (extrachromosomally) in the nucleus of 
a host cells such as in a mammalian host cell. This process may involve both 
virus trans-acting factors and the host cell replication machinery. Episomal 
replicating vectors derived from these viruses generally contain a replication 
origin and at least one viral protein, e.g., an initiator protein. Examples of such 
initiator proteins are large T-antigen for SV40, E1/E2 for BPV, and EBNA-1 
for EBV. 

[0007] Some host factors are able to recognize viral origins of replication and 

initiate replication. For example human S/MAR (scaffold/matrix attached 
region from the human interferon-beta gene) is able to maintain the episomal 
replication of a vector containing SV 40 origin of replication. On the other 
hand certain viral origins of replication (ORI) are recognized by corresponding 
specific virally-encoded proteins. For example, SV40 or polyoma virus ORI 
is recognized by large T antigen, BPV (Bovine papilloma virus) ORI is 
recognized by E1/E2 complex, EBV (Epstein-Barr virus) ORI is recognized 
by EBNA1 protein, and yeast origin of replication (ARS element) is 
recognized by ORE (origin recognition element) replication (Bode et al., 2001 
Gene Ther Mol Biol, v6, 33-46). 

[0008] The double strand DNA genome of Epstein-Barr virus EBV (1 65 kb) is 

maintained in the latently infected cells as a large circular episome. 
Amplification of this episome is mediated by a cis-acting sequence such as the 



origin of replication (OriP), and one of the viral proteins, the Epstein-Barr 
viral nuclear antigen 1 (EBNA1). In a minimal system the presence of both 
OriP (1.8 kb region) and EBNA-1 is sufficient for replication and stabilization 
of recombinant plasmids. The exact function of EBNA-1 is- still not clear, 
although the protein has a DNA binding domain. Generally EBNA-1 is 
involved in plasmid maintenance after completion of the synthesis and in 
plasmid segregation by dividing cells during proliferation. The OriP region is 
recognized also directly by me cellular DNA synthesizing machinery and is 
replicated during the cell cycle. 

[0009] Other gene expression systems are based on alphaviruses (Lundstrom, 

K., Curr. Opin. Biotechnol. 5:578-582 (1997)). Several members of the 
alphavirus family, e.g., Sindbis virus (Xiong, C. etal, Science 243:1 188-1191 
(1989); Schlesinger, S., Trends Biotechnol. 77:18-22 (1993)), SFV 
(LiljestrSm, P. & Garoff, H., Bio/Technology 9:1356-1361 (1991)) and others 
(Davis, N.L. et al., Virology 777:189-204 (1989)) have received considerable 
attention for the use as virus-based expression vectors for a variety of different 
proteins (Lundstrom, K., Curr. Opin. Biotechnol. 5:578-582 (1997); 
Liljestrom, P., Curr. Opin. Biotechnol. 5:495-500(1994)). 

[0010] Alphaviruses are positive stranded RNA viruses which replicate their 

genomic RNA entirely in the cytoplasm of the infected cell and without a 
DNA intermediate (Strauss, J. and Strauss, E., Microbiol. Rev. 55:491-562 
(1994)). The concept that alphaviruses can be developed as expression vectors 
was first established nearly ten years ago (Xiong, C. et al, Science 243:1 188- 
1191 (1989)). Since then, several improvements have made the use of these 
RNA replicons as expression vectors more practical (Lundstrom, K., Curr. 
Opin. Biotechnol. 5:578-582 (1997)). 

I0011J DNA vectors have been developed for both Sindbis virus (Herweijer, 

H. et al., Hum. Gene Ther. (5:1495-1501 (1995); Dubensky, T.W. et al., J. 
Virol. 70:508-519 (1996)) and SFV (Berglund, P. et al., Trends Biotechnol. 
74:130-134 (1996)). Eukaryotic promoters are introduced in these vectors 
upstream from the alphavirus replicase gene (consisting of the four non- 
structural protein genes (nsPl-4)), which are translated as one or two 
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polyproteins which are then proteolytically cleaved. DNA is transcribed into 
RNA from the recombinant eukaryotic promoter in the nucleus and 
transported to the cytoplasm, where the replicase catalyzes the replication of 
the alphavirus RNA molecule as during normal replication of the alphavirus 
RNA molecule (Strauss, J. and Strauss, E., Microbiol Rev. 55:491-562 
(1994)). Only transient expression of heterologous sequences has been 
possible until recently due to the cytopathogenicity of the alphavirus replicase 
(Lundstrom, K., Curr. Opin. Biotechnol 5:578-582 (1997)). 

[0012] Among the features which make Alphavirus vectors attractive as 

transient expression systems are the easy construction of plasmid clones, a 
rapid gene expression (within 24h), a high level of expression, a wide host 
range of the virus (many insect, avian and mammalian cells), production of 
high-titered stocks of infectious particles and low risk of cell transformation 
due to the RNA nature of the genome and its replication entirely in the host 
cell cytoplasm without going through DNA intermediates. (Schlesinger S., 
2000, Exp. Opin. Biol Then, 12, 177-191; Schlesinger S., Adv. Virus res., 
v55 t 565-577; Garoff, H. and KJ. Li, 1998, Curr. Opin. Biotechnol, 9, 464- 
469; Lundstrom, K., 1997, Curr. Opin. Biotechnol, 8. 578-583; Xiong et aL, 
1989, Science, 243, 1188-1191). Blasey et aL compared the Semliki Forest 
Virus (SFV) expression system with the Epstein-Barr Virus (EBV) system 
(pCEP4 and HEK293EBNA cells) for expression of 5-HT3 receptors and 
found that both systems allow the production of functional recombinant 
protein within days to weeks at very high comparable levels (in: Animal Cell 
Technology, 1999, pp33 1-337). 

[0013] One of the variants of alphavirus vectors is the replicon, which is 

capable of self-replication in the host cytoplasm after introduction into the 
cultured cells by transfection. In this replicon, viial structural protein genes 
have been replaced by a gene of interest (GOI). Such a replicon can be 
introduced into cells as RNA or cDNA if placed under the control of an 
eukaryotic promoter or a promoter, which is recognized by the host cell 
machinery. (Agapov et al, Proa Natl Acad Set USA 95:12989-12994 
(1998)). 
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[00141 RNA viruses have been engineered into chimeric viruses producing 

heterologous proteins. Such RNA virus vectors have been developed from 
Alphaviruses (with positive strand genome), Influenza and VSV (with 
negative strand genome). One of the major advantages of these viruses is their 
RNA-only replication strategy performed entirely in the cell cytoplasm and 
eliminating the risk of genome integration. 
[0015] Adeno-, Adeno-associated and Retrovirus vectors can be used as gene 

therapy vectors and/or as expression vectors, especially for large-scale protein 
production. Herpesvirus derivative vectors containing a replication origin have 
been used for bioreactor protein production after transient transfection of a 
suitable cell line carrying a replication initiation gene. (Durocher et al., 
Nucleic Acids Res. 30(2):E9 (2002); Meissner et al., Biotechnol. Bioeng. 
t 75(2): 197-203 (2001)). In these experiments, 293E cells (293 human embryo 
kidney cells expressing Epstein-Barr vims nuclear antigene - EBNA-1) were 
used which provided a replication initiation factor in trans to a plasmid 
carrying EBV origin of replication - OriP. Reporter genes (encoding secreted, 
intracellular or transmembrane proteins) were constitutively expressed over a 
period of several days. 
[0016] Hybrid vectors have also been reported, mainly as gene delivery 

systems or for use in virus biology research, such as combinations between 
EBV episomes and Herpes-, Adeno- (DNA) or Retro- (RNA-DNA) viruses 
(Sena-Esteves M. et al, 1999, Virol. Dec;7 3(12) .10426-39; Tan BT. et 
al.,1999, J Virol. Sep;73(9):7582-9; Grignani F. et al., 1998, Cancer Res Jan 
1; 58(1): 14-9; Wang S. and Vos JM., 1996, J Virol. Dec;70(12):8422-30). 
EBV episomes are nuclear plasmids that are stably maintained through 
multiple cell divisions in primate and canine cells (J. L. Yates, N. Warren, and 
B. Sugden, 1985, Nature 313:812-815). 
[0017] The ability to precisely control the expression of genes introduced into 

animal or human cells, or in whole organisms, enables significant progress in 
many areas of biology and medicine. For instance, methods that allow the 
intentional manipulation of gene expression will facilitate the analysis of 
genes whose expression cannot be tolerated constitutively. 



[0018] To be of broad benefit, gene regulation techniques preferably allow for 

rapid, robust, precise and reversible induction of gene activity. As reviewed in 
Saez, E. et al, (Curr. Opirt Biotechnol 5:608-616 (1997)), an ideal system 
would fulfill the following requirements: 

[0019] 1. Specificity - The system should be indifferent to endogenous 

factors and activated only by exogenous stimuli. 

[0020] 2. Non-interference - The components of the system should not 

affect unintended cellular pathways. 

[0021] 3. Inducibility - In the inactive state, the basal activity of the 

system should be minimal, while in the active state high levels of gene 
expression should be rapidly inducible. 

[0022] 4. Bioavailability of the inducer — Inducing stimuli should rapidly 

penetrate to the site of interest. 

[0023] 5. Reversibility — Inducing stimuli should clear swiftly to allow 

the system to rapidly return to the inactive state. 

[0024] One common system currently in use for the regulation of gene 

expression is the tetracycline-based system (Gossen and Bujard, Proc. Natl 
Acad. Set USA 89:5547 (1992)). This system is based on the continuous 
expression of a fusion protein with the tetracycline repressor protein (tetR) 
being converted into an activator by fusion to the transcriptional activation 
domain of the VP16 protein. The tTA system is useful for, e.g., inducible gene 
expression and has been successfully used for the production of a number of 
proteins (Wimmel et al. t Oncogene 9:995 (1994); Fruh et al, EMBO J. 
73:3236 (1994); Yu et al., J. Virol 70:4530 (1996)). However, there are 
certain disadvantages associated with the tTA system. (Schocket et al, Proc. 
Natl Acad Set USA 92:6522 (1995); Howe et al, J. Biol Chem. 25:14168 
(1995); Schocket and Schatz, Proc. Nail Acad Sci. USA 93:5X13 (1996); 
Bohl et al, Nat. Med 3:299 (1997); Furth et al, Proc. Natl Acad Sci. USA 
91:9302 (1994); Hennighausen et al, J. Cell Biochem. 59:463 (1995); Kistner 
et al, Proc. Natl Acad Sci. USA 93:10933 (1996); Hoffmann et al, Nucleic 
Acids Res. 25:1078 (1997)). 
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[0025] Inducible expression systems based on viral expression vectors (e.g., 

aphaviral vectors) are described in WO 99/50432 (incorporated herein by 
reference in its entirety). An exemplary alphaviral vector is pCytTs (also 
known as CytTs; cf. WO 99/50432). Using these systems, it is possible to 
precisely induce and regulate gene expression. This precise regulation results 
from the use of a temperature-sensitive RNA-dependent RNA polymerase 
(i.e., a replicase), which only replicates RNA molecules, to form new RNA 
molecules, at permissive temperatures (Boorsma et al. Nature Biotech. 18:429- 
432 (2001)). Due to the extremely tight regulation of these systems it is 
possible to obtain regulated gene expression even under conditions of transient 
transfection allowing the production of toxic proteins under these conditions. 

[0026] Another example of an alphaviral expression vector is described in 

Lundstrom etal, FEBS Lett 504:99-103 (2001). Lundstrom describe a rapidly 
generated high-titer Semliki Forest virus (SFV) vectors capable of infecting 
numerous mammalian cell lines and primary cell cultures, resulting in high 
levels of transgene expression. SFV-based expression of transmembrane 
receptors has been characterized by specific ligand-binding activity and 
functional responses. Adaptation of the SFV technology for mammalian 
suspension cultures has allowed the production of hundreds of milligrams of 
recombinant receptor for purification and structural studies. The same SFV 
stock solutions used for the infection of mammalian cells in culture have also 
been successfully applied for efficient transgene expression in organotypic 
hippocampal slices, as well as in vivo in rodent brain. 

[0027] A need exists in the art, however, for compositions and methods that 

allow the production of polypeptides and untranslated RNA molecules. 

BRIEF SUMMARY OF THE INVENTION 

[0028] The present invention provides compositions and methods that allow 

the production of polypeptides and/or untranslated RNA molecules. More 
specifically, the invention provides nucleic acid molecules, expression 
systems, recombinant host cells that permit the production of polypeptides 
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and/or untranslated RNA molecules. Also provided are methods for making 
the nucleic acid molecules, expression systems and host cells of the invention. 
The invention further provides methods for producing polypeptides and/or 
untranslated RNA molecules. The invention also provides kits that comprise 
the nucleic acid molecules, expression systems and/or recombinant host cells 
of the invention. 

[0029] In certain embodiments, the invention provides compositions and 

methods for regulated, expression of polypeptides or untranslated RNA 
molecules in recombinant host cells, preferably by way of transfection. The 
present invention also provides nucleic acid molecules and methods which 
allow, e.g., rapid and high level production of specific RNA molecules 
produced in, e.g., transiently transfected, recombinant host cells. 

[0030] According to certain aspects of the invention, novel, preferably 

inducible, potent and rapid expression systems are provided comprising one or 
more nucleic acid molecules. The nucleic acid molecules of the expression 
systems of the invention may be maintained at, or accumulated to, multiple 
copies in the nuclei of recombinant host cells, preferably of transiently 
transfected mammalian cells. In certain embodiments, RNA self replication 
may lead to the accumulation of a high number of RNA molecules in the 
cytoplasm of the transfected cells and subsequent translation of the 
polypeptide of interest The nucleic acid molecules and expression systems of 
the invention are particularly useful for transient transfection. 

[00311 The present invention further provides nucleic acid molecules and 

methods which allow inducible rapid and high level production of specific 
RNA molecules produced in, preferably transiently transfected, recombinant 
host cells. In certain embodiments, RNA self-replication is preferably 
inducible and triggered by a temperature shift. Thus, the high production 
levels, in certain embodiments, are achieved by the use of a temperature- 
sensitive RNA-dependent RNA polymerase (/.<?., a replicase) which only 
replicates RNA molecules to form new RNA molecules at permissive 
temperatures. Moreover, if desired, the amount of RNA or polypeptide can be 
tightly regulated by the use of a temperature-sensitive RNA-dependent RNA 
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polymerase. This facilitates, in particular, production of growth-inhibitory or 
toxic polypeptides or RNA-species. 
10032] In a general aspect, the present invention provides for a nucleic acid 

molecule comprising (A) a first polynucleotide element which encodes an 
RNA molecule, said RNA molecule comprising (a) at least one cfc-acting 
sequence element; (b) a first nucleotide sequence comprising a first open 
reading frame, said first open reading frame having a nucleotide sequence 
encoding an RNA-dependent RNA polymerase; and (c) at least one second 
nucleotide sequence selected from the group consisting of (i) a second open 
reading frame encoding a polypeptide; (ii) a nucleotide sequence 
complementary to all or part of the second open reading frame of (i); and (iii) 
a nucleotide sequence encoding an untranslated RNA molecule or complement 
thereof; wherein said second nucleotide sequence is operably linked to a 
promoter which is recognized by said RNA-dependent RNA polymerase; (B) a 
second DNA element comprising an origin of replication; and (C) a third DNA 
element encoding a replication initiation factor capable of recognizing the 
origin of replication. Thus, the replication initiation factor preferably 
recognizes the origin of replication leading to replication of the second 
polynucleotide and second DNA element, respectively, comprising the origin 
of replication. 

[0033J In certain embodiments, the RNA-dependent RNA polymerase is, e.g., 

a non-cytopathic RNA-dependent RNA polymerase. In other embodiments, 
the RNA-dependent RNA polymerase is, e.g., a temperature-sensitive RNA- 
dependent RNA polymerase. In yet other embodiments, the RNA-dependent 
RNA polymerase is, e.g., a non-cytopathic, temperature-sensitive RNA- 
dependent RNA polymerase. 

[0034] The nucleic acid molecules of the invention, in certain embodiments, 

comprise a 5' promoter which is capable of initiating transcription in vivo, 5' 
and/or 3' sequences enabling replication of the RNA molecule (exacting 
sequence elements), and a subgenomic promoter 5' to the gene of interest, as 
well as a sequence of interest which is translatable only after one or more 
RNA-dependent RNA replication events. These RNA-dependent RNA 
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replication events are catalyzed by a, preferably regulatable, RNA-dependent 
RNA polymerase which may be encoded by the same mRNA molecule that is 
produced by transcription of the DNA vector or by a different mRNA 
molecule. 

[0035] The inventive nucleic acid molecules, expression systems and vector 

systems of the present invention thus allow for large-scale transient 
transfection and very rapid production of polypeptides eliminating the need of 
isolating stably transformed high producing cell clones. The preferred 
embodiment and hereby the inducibility of the vector makes the invention 
especially suitable for the production of cytotoxic polypeptides, e.g. y 
polypeptides that are detrimental to the viability of a host cell. 

[0036] In a preferred embodiment of the present invention, the origin of 

replication and the replication initiation factor are derived from a DNA virus. 
For example, the origin of replication may be derived from a herpes virus and 
the replication initiation factor may be the Epstein-Barr virus nuclear antigen 1 
(EBNA1). However, replication initiation factors may also derive from other 
viruses or be cellular factors, which are capable of recognizing or recognize 
viral origins of replication. As indicated, in a further embodiment, the RNA- 
dependent RNA-polymerase is temperature-sensitive and temperature- 
regulated, respectively. Preferably, the RNA-dependent RNA-polymerase is of 
viral, more preferably of alphaviral origin. 

[0037] Thus, in a preferred embodiment, the present invention provides a 

hybrid vector and vector system combining the advantages of EBV episomal 
replication with the advantages of RNA virus replication, wherein the 
replication takes place entirely in the cytoplasm. This type of hybrid vector 
would be characterized by the maintenance of multiple copies of DNA due to 
polynucleotide replication, typically plasmid replication, followed by the 
accumulation of full-length transcripts containing the viral replicon. These 
viral full-length transcripts are then further exponentially amplified in the 
cytoplasm by the viral replicase leading to the accumulation of RNA virus 
replicons and protein synthesis. The system is extremely valuable for 



-11- 

production of toxic polypeptides if the RNA virus replication is controllable or 
inducible. 

[0038] In a specific embodiment, the invention includes the combination of a 

Herpesvirus mini-replicon unit, i.e. the cis-acting replication origin OriP 
(Origin of replication P) and the cis- or trans-acting gene product - EBNA-1 
(Epstein-Barr virus nuclear antigen 1, a replication triggering factor) with the 
tightly regulated temperature inducible alphaviral expression system pCytTS, 
as disclosed in WO 99/504332. Due to the presence of OriP and EBNA-1 the 
introduced novel vector is maintained episomally (extrachromosomally) in the 
form of several DNA copies in the cell nuclei. These multiple DNA copies 
may be transcribed from a promoter, typically a CMV or RSV promoter, into 
CytTS RNA-replicons, but they remain inactive unless cells are being shifted 
to a certain temperature. Only after temperature induction replicon replication 
takes place, followed by RNA accumulation and translation in the cell 
cytoplasm and production of the polypeptide of interest. 

[0039] In a further general aspect, the present invention provides a vector 

system comprising one or more nucleic acid molecules, wherein said one or 
more nucleic acid molecules comprise (A) a first polynucleotide which 
encodes an RNA molecule, said RNA molecule comprising (a) at least one 
cw-acting sequence element; (b) a first nucleotide sequence comprising a first 
open reading frame having a nucleotide sequence encoding an 
RNA-dependent RNA polymerase; and (c) at least one second nucleotide 
sequence selected from the group consisting of (i) a second open reading 
frame encoding a polypeptide; (ii) a sequence complementary to all or part of 
the second open reading frame of (i); and (iii) a sequence encoding an 
untranslated RNA molecule, or complement thereof; wherein said second 
nucleotide sequence is operably linked to a promoter which is recognized by 
said non-cytopathic RNA-dependent RNA polymerase; (B) a second 
polynucleotide comprising an origin of replication; and (C) a third 
polynucleotide encoding a replication initiation factor capable of recognizing 
the origin of replication. Thus, the replication initiation factor recognizes the 
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origin of replication leading to replication of the second polynucleotide and 
second DNA element, respectively, comprising the origin of replication. 

[0040] The invention further provides single- and multiple-vector systems for 

producing a polypeptide or untranslated RNA molecule. In a single-vector 
system, the first, second and third polynucleotide are present on the same 
nucleic acid molecule. In a multiple-vector system, the first, second and third 
polynucleotide elements are present on one or more separate nucleic acid 
molecules. In such a multiple vector system, at least one, preferably some or 
more preferably all vectors contain an origin of replication. Therefore, in 
preferred embodiments of the invention, the replication initiation factor and 
the origin of replication recognizing the replication initiation factor may be on 
the same nucleic acid molecule and plasmid, respectively, as it is the RNA- 
replicase or on a separate nucleic acid molecule and plasmid, respectively. 

[0041] When sequences encoding the first and second open reading frame are 

present either on the same nucleic acid molecule or in the same vector (i.e., in 
a single-vector system), a region will preferably be present 5 1 to the second 
open reading frame which inhibits translation of this open reading frame. 

[0042] If a temperature sensitive RNA-dependent RNA polymerase (e.g., a 

temperature-sensitive replicase) is included within the compositions and 
methods of the invention, the temperature-sensitive replicase may be "cold" or 
"hot" sensitive and thus will only efficiently catalyze RNA-dependent RNA 
replication at temperatures either above or below the restrictive temperature. 
In one preferred embodiment, the compositions and methods of the invention 
comprise, or involve the use of, an RNA-dependent RNA polymerase that has 
replicase activity at temperatures below 34°C which is at least five fold greater 
than the replicase activity exhibited at 34°C or above. Preferably, the 
replicase activity at temperatures below 34°C is at least ten fold greater than 
the replicase activity exhibited at 34°C or above. In certain embodiments, the 
replicase activity at temperatures below 34°C is at least one hundred fold 
greater than the replicase activity exhibited at 34°C or above. Temperature- 
sensitive RNA-dependent RNA polymerases included within the present 
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invention may, in certain embodiments, exhibit no (or undetectable) replicase 
activity at 34°C or above. 
[0043] In another aspect, the nucleic acid molecules and expression systems 

of the invention encode one or more cytokine, lymphokine, tumor necrosis 
factor, interferon, toxic protein, prodrug converting enzyme, or other 
polypeptide. 

[0044] In yet another aspect, the nucleic acid molecules and expression 

systems of the invention encode an untranslated RNA molecule, such as an 
antisense RNA molecule, tRNA molecule, rRNA molecule, or ribozyme. 

[0045] The invention also provides methods for making recombinant host 

cells comprising introducing nucleic acid molecules of the invention into host 
cells. Further provided are recombinant host cells produced by the 
introduction of nucleic acid molecules of the invention. In one embodiment, 
one, some or all of these recombinant host cells contain one or more nucleic 
acid molecules that comprise an RNA-dependent RNA polymerase and/or the 
replication initiation factor. The replication initiation factor, in certain 
embodiments, may be stably integrated into the genome of the one, some or all 
host cells. 

[0046] The invention further provides isolated nucleic acid molecules 

comprising polynucleotides which comprise the nucleotide sequence of SEQ 
ID NO: 10 (pCytTs OriP) and SEQ ID NO: 1 1 (pCytTs OPE). 

[0047] The present invention also provides methods for producing 

polypeptides and untranslated RNA molecules in recombinant host cells, said 
methods comprising introducing a nucleic acid molecule of the invention into 
a host cell to produce a recombinant host cell, and culturing the recombinant 
host cell under conditions suitable for expression of said polypeptide or 
untranslated RNA molecule. The methods of the invention may further 
comprise recovering said polypeptide or untranslated RNA molecule. 

[0048] Methods are also provided for the expression of heterologous 

polypeptides, preferably the regulated expression of heterologous 
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polypeptides, including cytokines, lymphokines, tumor necrosis factors, 
interferons, toxic polypeptides, and prodrug converting enzymes. 

[0049] Further provided are polypeptides and untranslated RNA molecules 

produced by the methods of the invention. 

[0050] The invention also provides methods for regulating the expression of a 

polypeptide or an untranslated RNA molecule, said method comprising 
introducing nucleic acid molecules of the invention into a host cell to produce 
a recombinant host cell and growing said recombinant host cell under suitable 
culture conditions. In certain embodiments, nucleic acid molecules encoding a 
temperature-sensitive RNA-dependent RNA-polymerase are used. When a 
temperature-sensitive RNA-dependent RNA-polymerase is used, the methods 
according to this aspect of the invention may further comprise changing the 
temperature of the recombinant host cell culture from either (i) a permissive 
temperature to a restrictive temperature, or (ii) a restrictive temperature to a 
permissive temperature. 

[0051] In certain embodiments, the methods of the invention involve 

introducing the nucleic acid molecules of the invention into prokaryotic or 
eukaryotic host cells to produce a recombinant host cell, and then culturing 
said recombinant host cell under suitable culture conditions. The recombinant 
host cells may be cultured, e.g., in a serum-free or protein-free medium. 

[0052] The present invention also provides pharmaceutical' compositions 

comprising nucleic acid molecules of the invention and a pharmaceutical^ 
acceptable carrier. 

[0053] Also included within the invention are kits comprising the nucleic acid 

molecules of the invention. Kits of the invention may additionally or 
alternatively comprise one or more expression systems of the invention and 
one or more recombinant host cells of the invention. 

BRIEF DESCRIPTION OF THE DRAWING 

[0054] Fig. 1 is a schematic representation of the pCYTts vector. The pCYTts 

vector contains an ampicillin resistance marker for selection in bacterial cells 
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and a ColEl sequence which directs high copy number bacterial amplification. 
The pCYTts vector was prepared as described in Example 1. 
[0055] Figs. 2A-2D show the complete cDNA sequence of pCYTts (SEQ ID 

NO:l). 

|0056] Fig. 3 demonstrates CytTs-EGFP-OriPd derived expression in 293- 

EBNA cells. Selected cell population of 293-EBNA cells, which had been 
transfected with CytTs-EGFP-OriPd and selected with puromycin were either 
kept at 37°C or shifted to 29°C for 24 hours in order to induce the Sindbis 
replicon. Cells were trypsinised and analyzed by flow cytometry. 
Untransfected and CytTs-EGFP-OriPd transfected 293-EBNA cells either 
cultivated at 37°C or at 29°C, as indicated above the respective panels were 
analyzed. GFP expression is shown on the X-axis (FL1-H) the Y-axis shows 
the FL4-H fluorescene. 

[0057] Fig. 4 is a FACS analysis of BHK cells transfected with CytTs-EGFP, 

CytTs-EGFP-OriP and CytTs-EGFP-OPE. BHK cells were transfected with 
CytTs EGFP, CytTs EGFP-OriP or CytTs EGFP-OPE and selected with 
puromycin at a concentration 5ug/ml. Two weeks after transfection cell 
populations were assayed for EGFP expression. Non-induced cells (37°C) and 
cells induced for 3 days at 29°C were trypsinized and analyzed by flow 
cytometry. GFP expression is shown on the X-axis (FL1-H) the Y-axis shows 
the FL4-H fluorescene. 

[0058] Figs. 5A-5B demonstrate IFNp expression in BHK cells transfected 

with CytTs-IFNp-OPE. BHK cells were transfected with CytTs-IFNp-OPE 
and selected with puromycin as described in the above. BHK-CytTs-IFNp- 
OPE cell populations were then either kept at 37°C or shifted to 29°C for 
induction. Samples were taken from uninduced cells and from the induced 
cells 3, 4, 7 and 9 days after induction and quantitated by ELISA. The 
interferon concentration as international units (IU)/ml at the different time 
points is shown in panel A. 0 corresponds to uninduced control cells. The fold 
induction observed is shown in panel B. 
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[0059] Fig. 6 shows SEAP expression in BHK cells transfected with CytTs- 

SEAP-OPE. BHK cells were transfected with CytTs-SEAP-OPE and selected 
with puromycin as described above. BHK-CytTs-SEAP-OPE cell populations 
were then either kept at 37°C or shifted to 29°C for induction. Samples were 
taken from the uninduced cells and from the induced cells 3, 4, 7 and 9 days 
after induction and quantitated. 

[0060] Fig. 7 demonstrates the improved EPO expression from pCytTS-OPE- 

EPO compared to pCytTS-OriP-EPO in 293 EBNA1 cells. Selected cell 
populations which had been transfected either with pCytTS-OPE-EPO or with 
pCytTS-OriP-EPO were either kept at 37°C or shifted to 29°C in order to 
induce the Sindbis replicon. Samples were taken from the induced and 
uninduced cells 0, 4 and 8 days after induction and were quantified by ELISA. 
The EPO concentration at the different time points is shown in mg/1 on the Y- 
axis. 

[0061] Figs. 8A-8E show the nucleotide sequence of pCytTs OriP (SEQ ID 

NO:10) 

[0062] Figs. 9A-9E Show the nucleotide sequence of pCytTs OPE (SEQ ID 

NO: 11). 

DETAILED DESCRIPTION OF THE INVENTION 

OVERVIEW 

[0063] The present invention is directed to compositions and methods that are 

useful, e.g., in the production of polypeptides and/or untranslated RNA 
molecules in host cells. The compositions and methods of the invention are 
also useful, e.g., for the isolation and/or purification of polypeptides, 
polypeptides and untranslated RNA molecules that are produced in host cells. 

[0064] According to one aspect of the invention, nucleic acid molecules are 

provided. The nucleic acid molecules of the invention comprise: (A) a first 
polynucleotide element which encodes an RNA molecule, said RNA molecule 
comprising: (a) at least one ci>acting sequence element; (b) a first nucleotide 
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sequence comprising a first open reading frame, said first open reading frame 
having a nucleotide sequence encoding an RNA-dependent RNA polymerase; 
and (c) at least one second nucleotide sequence selected from the group 
consisting of: (i) a second open reading frame encoding a polypeptide; (ii) a 
nucleotide sequence complementary to all or a part of the second open reading 
frame of (i); and (iii) a nucleotide sequence encoding an untranslated RNA 
molecule or complement thereof; wherein said second nucleotide sequence is 
operably linked to a promoter which is recognized by said RNA-dependent 
RNA polymerase; (B) a second polynucleotide element comprising an origin 
of replication; and (C) a third polynucleotide element encoding a replication 
initiation factor capable of recognizing said origin of replication. 
[0065] According to another aspect of the invention, expression systems are 

provided. The expression systems of the invention comprise one or more 
nucleic acid molecules, wherein said one or more nucleic acid molecules 
comprise: (A) a first polynucleotide element which encodes an RNA 
molecule, said RNA molecule comprising: (a) at least one cw-acting sequence 
element; (b) a first nucleotide sequence comprising a first open reading frame, 
said first open reading frame having a nucleotide sequence encoding an RNA- 
dependent RNA polymerase; and (c) at least one second nucleotide sequence 
selected from the group consisting of: (i) a second open reading frame 
encoding a polypeptide; (ii) a nucleotide sequence complementary to all or a 
part of the open reading frame of (i); and (iii) a nucleotide sequence encoding 
an untranslated RNA molecule or complement thereof; wherein said second 
nucleotide sequence is operably linked to a promoter which is recognized by 
said RNA-dependent RNA polymerase; (B) a second polynucleotide element 
comprising an origin of replication; and (C) a third polynucleotide element 
encoding a replication initiation factor capable of recognizing said origin of 
replication. 

[0066] In the expression systems of the invention, said first, second and third 

polynucleotide elements may each be on a separate nucleic acid molecule. 
Alternatively, said first, second and third polynucleotide elements may each be 
on a single nucleic acid molecule. In certain embodiments, said first and 
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second polynucleotide elements will be on a single nucleic acid molecule 
while said third polynucleotide element is on a different nucleic acid molecule. 
In certain other embodiments, said first and third polynucleotide elements will 
be on a single nucleic acid molecule, while said second polynucleotide 
element is on a different nucleic acid molecule. In certain other embodiments, 
said second and third polynucleotide elements will be on a single nucleic acid 
molecule, while said first polynucleotide element is on a different nucleic acid 
molecule. 

[0067] According to another aspect of the invention, recombinant host cells 

and in vitro cell cultures comprising recombinant host cells are provided. The 
recombinant host cells of the invention comprise: (A) a first polynucleotide 
element which encodes an RNA molecule, said RNA molecule comprising: (a) 
at least one cw-acting sequence element; (b) a first nucleotide sequence 
comprising a first open reading frame, said first open reading frame having a 
nucleotide sequence encoding an RNA-dependent RNA polymerase; and (c) at 
least one second nucleotide sequence selected from the group consisting of: (i) 
a second open reading frame encoding a polypeptide; (ii) a nucleotide 
sequence complementary to all or a part of the second open reading frame of 
(i); and (iii) a nucleotide sequence encoding an untranslated RNA molecule or 
complement thereof; wherein said second nucleotide sequence is operably 
linked to a promoter which is recognized by said RNA-dependent RNA 
polymerase; (B) a second polynucleotide element comprising an origin of 
replication; and (C) a third polynucleotide element encoding a replication 
initiation factor capable of recognizing said origin of replication. 

[0068] According to another aspect of the invention, methods are provided for 

producing, or regulating the expression of, a polypeptide or untranslated RNA 
molecule. The methods according to this aspect of the invention comprise: (a) 
introducing a nucleic acid molecule or an expression system of the invention 
into a host cell to produce a recombinant host cell; and (b) culturing said 
recombinant host cell. The methods of the invention may further comprise 
recovering said polypeptide or untranslated RNA molecule. 
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[0069J According to another aspect of the invention, kits are provided 

comprising the nucleic acid molecules, expression systems and/or host cells of 
the invention. 

[0070] In certain embodiments, said second open reading frame that is 

included within the compositions and methods of the invention is in a 
translatable format after one or more RNA-dependent RNA replication events. 
The expression "a translatable format" is intended to mean an RNA from 
which a protein can be made. 

[00711 The RNA-dependent RNA polymerase that is included in, or used 

with, the compositions and methods of the invention may, in certain 
embodiments, be selected from the group consisting of: (a) a temperature- 
sensitive RNA-dependent RNA polymerase, (b) a non-cytopathic RNA- 
dependent RNA polymerase, and (c) a temperature-sensitive, non-cytopathic 
RNA-dependent RNA polymerase. 

[0072] When the RNA-dependent RNA polymerase that is included in, or 

used with, the compositions and methods of the invention is temperature 
sensitive, the temperature-sensitive RNA-dependent RNA polymerase may 
have replicase activity at temperatures below 34°C which is at least five fold, 
at least ten fold, at least twenty fold, at least thirty fold, at least forty fold, at 
least fifty fold, at least one hundred fold, or at least one thousand fold greater 
than the replicase activity exhibited at 34°C or above. For example, the 
temperature-sensitive RNA-dependent RNA polymerase may have replicase 
activity at 34°C that is at least five fold greater than the replicase activity 
exhibited at 29°C. The temperature-sensitive RNA-dependent RNA 
polymerase may, in certain embodiments, exhibit no or undetectable replicase 
activity (using methods of detection known in the art) at 34°C or above. 

[0073] The RNA-dependent RNA polymerase that is included in, or used 

with, the compositions and methods of the invention may, in certain 
embodiments, be of viral origin. For example, the RNA-dependent RNA 
polymerase may be of alphaviral origin. In specific embodiments of the 
invention, the RNA-dependent RNA polymerase is derived from a Sindbis 
virus, a Semliki Forest virus or an Aura virus. The RNA-dependent RNA 
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polymerase may be derived from a virus selected from the group consisting of 
Bebaru virus, Cabassou virus, Chikungunya virus, Easter equine 
encephalomyelitis virus, Fort Morgan virus, Getah virus, Kyzylagach virus, 
Mayoaro virus, Middleburg virus, Mucambo virus, Ndumu virus, Pixuna 
virus, Tonate virus, Triniti virus, Una virus, Western equine encephalomyelitis 
virus, Whataroa virus, Venezuelan equine encephalomyelitis virus (VEE), and 
Ross River virus. 

[0074] The origin of replication that is included in, or used with, the 

compositions and methods of the invention may be derived from a prokaryotic 
organism, a eukaryotic organism (e.g., a yeast, insect or mammal), or a virus. 
For example, the origin of replication may be derived from a DNA virus, e.g., 
a DNA virus that allows for episomal replication. The origin of replication 
may be derived from a DNA virus selected from the group consisting of 
Herpesvirus, Epstein-Barr virus (EBV), Papillomavirus, Polyomavirus, 
Adenovirus, and Hepadnavirus. In a specific embodiment, the origin of 
replication is oriP, derived from EBV. 

[0075] The replication initiation factor that is included in, or used with, the 

compositions and methods of the invention is, in certain embodiments, capable 
of operating as a plasmid maintenance factor. As used herein, the term 
"plasmid maintenance factor" is intended to mean a factor, which supports the 
distribution of the episomes to the daughter cells upon cell division. In the 
case of EBNA-1, this is achieved by the binding of the protein to the episome 
and, at the same time, to the chromosome, thereby leading to proper 
distribution of the episomes to the daughter cells. The replication initiation 
factor may be derived from a prokaryotic organism, a eukaryotic organism 
(e.g., a yeast, insect or mammal), or a virus. For example, the replication 
initiation factor may be derived from a DNA virus, e.g., a DNA virus that 
allows for episomal replication. The replication initiation factor may be 
derived from a DNA virus selected from the group consisting of Herpesvirus, 
Epstein-Barr virus (EBV), Papillomavirus, Polyomavirus, Adenovirus, and 
Hepadnavirus. In a specific embodiment, the replication initiation factor is the 
EBNA-1 protein, derived from EBV. 
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[0076] The origin of replication and the replication initiation factor that are 

included in, or used with, the compositions and methods of the invention may, 
in certain embodiments, be derived from the same organism or the same virus. 
Alternatively, the origin of replication and the replication initiation factor may 
be derived from different organisms or viruses. 

[0077] The nucleic acid molecules and expression systems of the invention 

may further comprise a fourth polynucleotide element, wherein said fourth 
polynucleotide element comprises a selection marker. Any selection marker 
known in the art may be used, including, e.g. , markers that confer resistance to 
antibiotics. An exemplary selection marker is one that confers resistance to 
puromycin. 

[0078] The second open reading frame that is included in, or used with, the 

compositions and methods of the invention may encode, in certain 
embodiments, any polypeptide of interest For example, the second open 
reading frame may encode, e.g., a cytokine, a lymphokine, a tumor necrosis 
factor, an interferon (e.g., beta-interferon, including human beta-interferon), a 
toxic polypeptide, or a prodrug. 

[0079] The second open reading frame that is included in, or used with, the 

compositions and methods of the invention may encode, in certain 
embodiments, an untranslated RNA molecule. Exemplary untranslated RNA 
molecules include, e.g., an antisense RNA molecule, a tRNA molecule, an 
rRNA molecule and a ribozyme. Other useful untranslated RNA molecules 
are known in the art. 

[0080] Thus, in certain exemplary embodiments, the invention provides 

compositions and methods which allow for the production of high amounts of 
specific RNA molecules produced in host cells upon their transient 
transfection with said polynucleotides. This high production results, e.g, from 
the use of a self-replicating episomal DNA plasmid combined with an RNA- 
dependent RNA polymerase, which amplifies the mRNA of interest. 

[0081] Expression, using the compositions and methods of the invention, may 

be regulated precisely, if necessary. Such precise regulation can be achieved, 
e.g., from the use of a temperature-sensitive RNA-dependent RNA polymerase 
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which only replicates RNA molecules to form additional RNA molecules at 
pennissive temperatures. 

[0082] According to the present invention, polypeptides and/or untranslated 

RNA molecules may be produced in transient transfection experiments due to 
the use of nucleic acid molecules comprising an origin of replication and DNA 
encoding a protein (a factor or a DNA-dependent replicase) that can recognize 
the origin of replication, thereby resulting in replication of the plasmid. 

[0083] The expression systems of the invention, in certain embodiments, 

comprise alphavirus DNA vectors that can be used to create transiently 
transfected cell lines. Said alphavirus DNA vectors may cany genes encoding 
a non-cytopathic replicase and viral non-structural proteins, thereby being able 
to produce high amounts of polypeptide. An induction of expression is 
achieved when the activity of the temperature-sensitive replicase is switched 
on by reducing the incubation temperature of the transfected cells from 37°C 
to a temperature lower than 34°C. Protein expression in the host at 37°C is, 
preferably, below the level of detection. 

DEFINITIONS 

[0084] The following definitions are provided to clarify the subject matter, 

which the inventors consider to be the present invention. 

[0085] As used herein, a nucleic acid molecule is a sequence of contiguous 

nucleotides (riboNTPs, dNTPs or ddNTPs, or combinations thereof) of any 
length which may encode a full-length polypeptide or a fragment of any length 
thereof, or which may be non-coding. As used herein, the terms "nucleic acid 
molecule," and "polynucleotide," "polynucleotide construct," and 
"polynucleotide element" may be used interchangeably. 

[0086] As used herein, the term "alphavirus" refers to any of the RNA viruses 

included within the genus Alphavirus. Descriptions of the members of this 
genus are contained in Strauss and Strauss, Microbiol Rev., 55:491-562 
(1994). Examples of alphaviruses are selected from the group comprising 
Aura virus, Bebaru virus, Cabassou virus, Chikungunya virus, Easter equine 
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encephalomyelitis virus, Fort morgan virus, Getah virus, Kyzylagach virus, 
Mayoaro virus, Middleburg virus, Mucambo virus, Ndumu virus, Pixuna 
virus, Tonate virus, Triniti virus, Una virus, Western equine encephalomyelitis 
virus, Whataroa virus, Sindbis virus (SIN), Semliki forest virus (SFV), 
Venezuelan equine encephalomyelitis virus (VEE), and Ross River virus. 

[0087] As used herein, the term "purified" used in reference to a molecule 

means that the concentration of the molecule being purified has been increased 
relative to molecules associated with it in its natural environment Naturally 
associated molecules include polypeptides, nucleic acids, lipids and sugars but 
generally do not include water, buffers, and reagents added to maintain the 
integrity or to facilitate the purification of the molecule. For example, even if 
mRNA is diluted with an aqueous solvent during oligo dT column 
chromatography, mRNA molecules are purified by this chromatography if 
naturally associated nucleic acids and other biological molecules do not bind 
to the column and are separated from the subject mRNA molecules. 

[0088] As used herein, the term "isolated" used in reference to a molecule 

means that the molecule has been removed from its native environment. For 
example, a polynucleotide or a polypeptide naturally present in a living animal 
is not "isolated," but the same polynucleotide or polypeptide separated from 
the coexisting materials of its natural state is "isolated." Further, recombinant 
DNA molecules contained in a vector are considered isolated for the purposes 
of the present invention. Isolated RNA molecules include in vivo or in vitro 
RNA replication products of DNA and RNA molecules. Isolated nucleic acid 
molecules further include synthetically produced molecules. Additionally, 
vector molecules contained in recombinant host cells are also isolated. Thus, 
not all "isolated" molecules need to be "purified." 

[0089] As used herein, the phrase "low or undetectable," when used in 

reference to gene expression level, refers to a level of expression which is 
either significantly lower than that seen when the gene is maximally induced 
(e.g., at least five fold lower) or is not readily detectable by the methods used 
in the following examples section. 
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[0090J As used herein, the phrase "individual" refers to multicellular 

organisms and includes both plants and animals. Preferred multicellular ' 
organisms are animals, more preferred are vertebrates, even more preferred are 
mammals, and most preferred are humans. 

[0091] As used herein, the phrase "cz>acting" sequence refers- to nucleic acid 

sequences to which a replicase binds to catalyze the RNA-dependent 
replication of RNA molecules. These replication events result in the 
^ replication of the ftdl-length and partial RNA molecules and, thus, the 
alphavirus subgenomic promoter is also a "cw-acting" sequence. Cis-acting 
sequences may be located at or near the 5' end, 3' end, or both ends of a 
nucleic acid molecule, as well as internally. 

[0092] As used herein, the phrase "RNA-dependent RNA polymerase" refers 

to a polymerase which catalyzes the production of an RNA molecule from 
another RNA molecule. This term is used herein synonymously with the term 
"replicase." 

[0093] As used herein, the phrase "non-infective packaged RNA molecules" 

refers to packaged RNA molecules which will essentially undergo only one 
round of host cell infection and are not pathogenic. These molecules are thus 
"infective" but only for a single infectious entry into a host cell and are not 
capable of reproducing to form additional infectious particles. 

[0094] As used herein, the term "transcription" refers to the production of 

RNA molecules from DNA templates catalyzed by RNA polymerases. 

[0095] As used herein, the phrase "RNA-dependent RNA replication event" 

refers to processes which result in the formation of an RNA molecule using an 
RNA molecule as a template. 

[0096] As used herein, the term "vector" refers to an a polynucleotide 

construct, typically a plasmid or a virus, used to transmit genetic material to a 
host cell. Preferably, the term "vector" as used herein refers to an agent such 
as a plasmid, and even more preferably to a circular plasmid, A vector as used 
herein may be composed of either DNA or RNA. Preferably, a vector as used 
herein is composed of DNA. 
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[0097] As used herein, the term "heterologous sequence" refers to a second 

nucleotide sequence present in a vector of the invention. The term 
] "heterologous sequence" also refers to any amino acid or RNA sequence 
encoded by a heterologous DNA sequence contained in a vector of the 
invention. Heterologous nucleotide sequences can encode polypeptides or 
RNA molecules normally expressed in the cell type in which they are present 
or molecules not normally expressed therein (e.g., Sindbis structural proteins). 

[0098] As used herein, the phrase "untranslated RNA" refers to an RNA 

sequence or molecule which does not contain an open reading frame or 
encodes an open reading frame, or portion thereof, but in a format in which an 
amino acid sequence will not be produced (e.g., no initiation codon is present). 
Examples of such molecules are tRNA molecules, rRNA molecules, and 
ribozymes. Antisense RNA may be untranslated but, in some instances 
antisense sequences can be converted to a translatable sense strand from which 

4 a polypeptide is produced. 

•J - 

[0099] As used herein, the phrase "temperature-sensitive" refers to an enzyme 

which readily catalyzes a reaction at one temperature but catalyzes the same 
reaction slowly or not at all at another temperature. An example of a 
temperature-sensitive enzyme is the replicase protein encoded by the pCytTs 
vector, which has readily detectable replicase activity at temperatures below 
34°C and has low or undetectable activity at 37°C. 

[00100] As used herein, the phrase "permissive temperature" refers to 

temperatures at which an enzyme has relatively high levels of catalytic 
activity. 

[00101] As used herein, the phrase "restrictive temperature" refers to 
temperatures at which an enzyme has low or undetectable levels of catalytic 
activity. Both "hot" and "cold" sensitive mutants are known and, thus, a 
restrictive temperature may be higher or lower than a permissive temperature. 

[00102] As used herein, the term "recombinant host cell" refers to a host cell 
into which one ore more nucleic acid molecules of the invention have been 
introduced. 
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[00103] As used herein, the term "replication initiation factor" refers to a 
protein or a DNA sequence encoding such a protein that is able to recognize 
an origin of replication leading to replication of plasmids containing this 
origin of replication. 

[00104] As used herein, the term "origin of replication" refers to a DNA 

sequence that is recognized by a replication initiation factor or a DNA 
replicase leading to replication of a plasmid containing the origin of 
replication. The expression "recognized by a replication initiation factor" is 
intended to mean that a replication initiation factor can physically interact with 
all or a portion of an origin of replication sequence, thereby causing or 
stimulating molecular mechanisms that ultimately cause all or a portion of the 
DNA molecule comprising the origin of replication to be replicated. 

[00105] As used herein the term "inducible" means that in an inactive state, the 

based activity of a system should be minimal, while in the active state high 
levels of gene expression should be rapidly inducible. 

[00106] When the terms "one," "a," or "an" are used in this disclosure, they 
mean "at least one" or "one or more," unless otherwise indicated. 

NUCLEIC ACID MOLECULES AND EXPRESSION SYSTEMS 

[00107] Certain preferred embodiments of the present invention are based on 

the vectors used in the previously described pCytTs expression system (WO 
99/50432, the disclosure of which is hereby incorporated by reference in its 
entirety). These vectors are constitutively transcribed in host cells to produce 
mRNA molecules having two open reading frames. These open reading 
frames, which may or may not be produced from the same nucleic acid 
molecule, encode a temperature-sensitive replicase and a heterologous gene of 
interest. The first open reading frame is translated to produce an 
RNA-dependent RNA polymerase. The second open reading frame, encoding 
all or part of one or more polypeptides of interest, is not translated until after 
at least one RNA-dependent RNA replication event. 
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[00108] The nucleic acid molecule of the present invention comprise, inter alia, 

at least one second nucleotide sequence. The second nucleotide sequence is 
preferably operably linked a promoter which is recognized by an RNA- 
dependent RNA polymerase. The second nucleotide sequence, in certain 
embodiments, may comprise a second open reading frame, or other nucleotide 
sequence of interest, as well as other elements. For instance, the second 
nucleotide sequence may comprise a 5* promoter which is capable of initiating 
synthesis of RNA in vivo, 5 1 and/or 3' sequences enabling replication of the 
RNA molecule (5 1 and 3 1 cis acting sequence elements), as well as a sequence 
of interest which is translatable only after at least one replication event. 
Replication is catalyzed by an RNA-dependent RNA polymerase which is 
encoded alternatively on the same or on a different mRNA molecule. The 
sequence of interest may be encoded in sense, plus (+), orientation 
downstream of a viral RNA promoter. Translation of the coding sequence of 
the gene of interest is inhibited by a 5' sequence which, in the case of the 
single-vector system, will generally be the replicase sequence. In the 
multiple-vector system, a 5' sequence can inhibit translation by having one or 
more short open reading frames with associated stop codons which lead to the 
detachment of ribosomes. Similarly, any sequence which inhibits the traveling 
or binding of ribosomes to the sequence of interest can be used as a 5' 
sequence which inhibits translation (Voet and Voet, BIOCHEMISTRY, John 
Wiley & Sons, Inc. (1990)). 

[00109] Another method for preventing translation of nucleotide sequences in 
most biological systems involves the insertion of the sequence in an antisense 
direction. This method of inhibiting translation is based on the principle that 
translation will generally only occur after the replication of this minus (-) 
strand RNA into a plus strand having an open reading frame in a sense 
orientation. The translated sense strand is formed by RNA replication and 
serves as a template for ribosomes and protein synthesis. As shown 
previously (WO 99/50432) production of amino acid sequences can occur 
even when the gene of interest is inserted into the DNA molecule in an 
orientation which will result in the formation of antisense RNA sequence 3' to 



-28- 

the subgenomic promoter. Thus, the second open reading frame may also 
comprise a sequence complementary to all or part of the second open reading 
frame described above and expression of the encoded amino acid sequence 
will still occur. When the production of an untranslated antisense RNA 
sequence is desired, the RNA molecule can be designed so that it will not 
serve as a template for protein synthesis. For example, the RNA can be 
designed so that an initiation codon is not present. 

[00110] The second nucleotide sequence may alternatively or additionally 
comprise a nucleotide sequence encoding an untranslated RNA molecule or 
complement thereof. For example, RNA molecules directly produced by 
transcription of a DNA sequence of the invention may encode RNA sequences 
which are neither translated nor present in an antisense orientation. Examples 
of such untranslated RNA molecules include tRNA molecules, rRNA 
molecules, and ribozymes. A considerable number * of ribozyme sequences 
with defined catalytic activities are known in the art (see, e.g., Brown, J., 
Nucleic Acids Res. 26:351-352 (1998); Xie, Y. et at, Proc. Natl Acad. Set. 
USA P¥:13777-13781 (1997); Lavrovsky. Y et al, Biochem. Mol Med. 
(52:11-22 (1997); Chapman, K. and Szostak, J., Chem. Biol 2:325-333 
(1995)). Further, ribozymes have been used to "knockout" the expression of a 
specific gene in eukaryotic cells as part of a ribozyme-mediated, message 
deletion strategy (Xie, Y. et al, Proc. Natl Acad ScL USA 94:13777-13781 
(1997)). Additionally, alphaviral replicons have been used to express a 
functional ribozyme in mammalian cells (Smith S. et al t J. Virol 
77:9713-9721 (1997)). The regulated expression of such ribozymes, and other 
untranslated RNA molecules, is thus within the scope of the present invention. 

[00111] The nucleic acid molecules and expression systems of the invention 
also comprise a second polynucleotide element comprising an origin of 
replication. Origins of replication that can be used with the present invention 
include, e.g., those that are derived from prokaryotic organisms, eukaryotic 
organisms (e.g., yeasts, insects, or mammals) and/or viruses. For example, an 
origin of replication that may be used with the present invention is one that is 
derived from a DNA virus such as, e.g., a Herpesvirus, a Papillomavirus, an 
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Adenovirus, or a Hepadnavirus. In certain embodiments, the origin of 
replication that is used with the invention is a DNA virus that allows for 
episomal replication. In a preferred embodiment, the origin of replication is 
derived from Epstein-Barr virus (EBV), such as, e.g., oriP. 

[00112] The nucleic acid molecules and expression systems of the invention 
also comprise a third polynucleotide element encoding a replication initiation 
factor. The replication initiation factor will be capable of recognizing the 
origin of replication that is included within the nucleic acid molecule or 
expression system. The third polynucleotide element of the invention may 
encode, e.g., a replication initiation factor derived from prokaryotic organism, 
eukaryotic organism (e.g., yeasts, insects, and/or mammals) and/or viruses. 
For example, a replication initiation factor that may be used with the present 
invention is one that is derived from a DNA virus such as, e.g., a Herpesvirus, 
a Papillomavirus, an Adenovirus, or a Hepadnavirus. In certain embodiments, 
the replication initiation factor that is used with the invention is capable of 
operating as a plasmid maintenance factor. In a preferred embodiment, the 
replication initiation factor is derived from Epstein-Barr virus (EBV), Such as, 
e.g., the EBNA-1 protein, or a portion thereof. 

[00113] The replication initiation factor {e.g., a nucleotide sequence that 
encodes a replication initiation factor) and the origin of replication that are 
included within the nucleic acid molecules and expression systems of the 
present invention may be derived from the same organism or from the same 
virus. Alternatively, the replication initiation factor and the origin of 
replication may be derived from different organisms or from different viruses. 

[00114] In an exemplary embodiment of the invention, nucleic acid molecules 
are provided that are constructed by introducing defined Epstein-Barr virus 
(EBV) sequences into vectors such as, e.g., those included within the pCytTs 
system. The resulting exemplary nucleic acid molecules of the invention are 
capable of replicating as non-integrated autonomous episomal molecules in 
the transformed host cells. 

[00115] The defined sequences, which may be introduced into the vectors of 
the pCytTs system, comprise, e.g., either one or two elements of Epstein-Barr 
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virus, (OriP alone or together with EBNA-l gene) which permit plasmid 
maintenance. The origin of replication OriP is a cis-acting sequence and needs 
to be inserted into the plasmid vector sequence. Plasmids containing this 
origin of replication are able to be maintained in cells expressing the 
replication initiation factor which recognizes the origin of replication; one of 
these factors is EBNA-1. EBNA-1 gene function can be provided in cis by 
introducing the sequence into the same plasmid or can be provided in trans by 
co-transfection with a second replicating or non-replicating plasmid as well as 
by providing it from stably transduced cell lines expressing EBNA-1 gene 
from an integrated copy. Furthermore, in certain embodiments, more than one 
copy of the sequence expressing the replication initiation factor, e.g. EBNA-1, 
is provided. For example, EBNA-1 gene function may be provided on one or 
more plasmids, wherein none, some or all of those plasmids may additionally 
comprise the origin of replication, and which plasmids are used for 
transfecting cell lines, preferably cell lines that have the sequence of EBNA-1 
stably integrated within its genome. The presence of both OriP and EBNA1 
sequences in the same plasmid renders replication less dependent on the host 
cell type. (Reisman, D. et aL, Mol Cell Biol 5: 1822-1832 (1985); Yates, J.L. 
et aL, Nature 575:812-815 (1985); U.S. Patent No. 4,686,186). U.S. Patent 
No. 4,686,186 describes the transfection of cells with a single plasmid 
containing the EBV OriP, the EBNA-1 gene and a gene encoding a protein of 
interest. 

[00116] Although such cell lines have the advantage of stable long-term 
expression of the replication initiation factor and durable support of replication 
and maintenance of OriP containing plasmids, there are not many EBNA-1 
expressing cell lines commercially available (ATCC : 293HEK-EBN A 1 and 
CV1-EBNA1). Alternatively, plasmids that already carry the EBNA-1 gene 
and the gene of interest in cis on the same episome are used to transfect cells 
and commercial vectors such as pCEP4 (Invitrogen) are available. However, 
current vectors designed for constitutive expression of polypeptides carrying 
OriP and EBNA-1 on the episomal construct in cis and wherein the expression 
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is neither regulatable or inducible, are not applicable in cases where cell-toxic 
polypeptides need to be expressed. 
[00117] To our knowledge, an inducible expression system having the features 
of a self-replicating DNA episome and that allows the expression of a 
polypeptide or untranslated RNA sequence of interest from a self-replicating 
RNA molecule, triggered by a simple temperature shift, has not been 
described. 

[00118] Certain embodiments of the invention are directed to nucleic acid 
molecules which are transcribed to produce a mRNA molecule having two 
open reading frames, e.g. 9 one open reading frame which encodes a replicase, 
and another open reading frame which encodes a nucleotide sequence of 
interest The nucleic acid molecules of the invention may contain a promoter 
sequence which drives transcription to produce mRNA molecules having 
coding sequences of both open reading frames. The mRNA sequences of the 
first open reading frame are preferably translated to produce a replicase 
required for the expression of the RNA sequences of the second open reading 
frame. The second open reading frame preferably encodes one or more 
polypeptides of interest. In addition, the vectors contain an origin of 
replication and a DNA polymerase recognizing the origin of replication. 

[00119] Further, once the first mRNA molecule has been transcribed from the 
DNA vector, additional RNA-dependent RNA replication events may occur to 
amplify the first mRNA sequence and to produce RNA molecules with strand 
polarity which is the opposite of the first mRNA sequence. 

[00120] The second open reading frame of the nucleic acid molecules of the 
invention will preferably only be expressed after partial replication of a full- 
length RNA molecule. This partial replication of the full-length RNA 
molecules is driven by a promoter sequence composed of RNA {e.g., an 
alphaviral subgenomic promoter sequence). In addition, the plasmid encoding 
the replicase and the gene of interest will preferably replicate during division 
of the host cell due to the presence of origin of replication and a protein factor 
recognizing the origin of replication. 
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[00121] While the gene of interest may be encoded by the same RNA molecule 

as the RNA-dependent RNA replicase protein, this gene may also be encoded 
by a separate RNA molecule. Thus, the invention further provides both 
single- and multiple-vectors systems for expressing a gene of interest. 
Moreover, while all plasmids used preferably contain an origin of replication, 
the gene encoding the replication initiation factor may be on a separate 
plasmid or stably integrated into the genome of the host cell. 

[00122] In a single-vector system of the invention, sequences encoding the first 
open reading frame, the second nucleotide sequence are components of the 
same nucleic acid molecule, the origin of replication and the initiation factor. 
Thus, all of the components required for regulated expression of the gene of 
interest and plasmid amplification are contained in a single nucleic acid 
molecule (le. , DNA or RNA). 

[00123] In a multiple-vector system of the invention, sequences encoding the 
first open reading frame, or sub-portions thereof, and the second nucleotide 
sequence are components of different nucleic acid molecules. These 
multiple-vector systems thus may comprise two or more nucleic acid 
molecules. For example, the replicase and the gene of interest can each be 
encoded by different nucleic acid molecules. However, all vectors may 
contain an origin of replication. 

[00124] While any functional promoter can be used to drive the transcription of 
mRNA from the nucleic acid molecule of the invention, the promoter is 
preferably a constitutive RNA polymerase II promoter (e.g., Rous Sarcoma 
Virus (RSV), cytomegalovirus (CMV), simian virus 40 (SV40), 
myeloproliferative sarcoma virus (MPSV), glucocorticoid, metallothionein, 
Herpes simplex virus thymidine kinase (HSVTK), human immune deficiency 
(HIV), mouse mammary tumor virus (MMTV), human polyomavirus BK 
(BKV), or Moloney murine leukemia virus (MuLV) promoter). Additional 
promoters suitable for use in the practice of the present invention are known in 
the art {see, e.g., Lee, A. et al t Mol Cells. 7:495-501 (1997)). 

[00125] The nucleic acid molecules of the invention may further comprise a, 
fourth polynucleotide element, wherein said fourth polynucleotide element 
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comprises a selection marker. The selection marker may facilitate the cloning 
and amplification of the vector sequences in prokaryotic and eukaryotic 
organisms. In certain embodiments, the selection marker will confer 
resistance to a compound or class of compounds, such as an antibiotic. An 
exemplary selection marker that can be used with the nucleic acid molecules 
and expression systems of the present invention is one that confers resistance 
to puromycin. 

[00126] The pCytTs vector contains an ampicillin resistance marker for 
positive selection in bacterial host cells and an E. coli origin of replication 
(i.e., ColEl). A considerable number of sequences encoding additional 
selection markers and origins of replication are known in the art (see, e.g., 
Sambrook, J. et al, eds., Molecular Cloning, A Laboratory Manual, 
2nd. edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
(1989); Ausubel, F. et aL, eds., Current Protocols in Molecular 
Biology, John H. Wiley & Sons, Inc. (1997)). 

[00127] The nucleic acid molecules and expression systems of the invention 
comprise a first nucleotide sequence comprising a first open reading frame, 
said first open reading frame having a nucleotide sequence encoding a 
replicase, e.g., an RNA-dependent RNA polymerase. The first nucleotide 
sequence may, in certain embodiments, also comprise 5 f and 3 f exacting 
sequences, and junction sequences containing a subgenomic promoter. 

[00128] The RNA-dependent UNA polymerases that can be used with the 
present invention may be of viral origin.. The RNA-dependent RNA 
polymerase may be derived, e.g., from virus selected from the group 
consisting of: Bebaru virus, Cabassou virus, Chikungunya virus, Easter equine 
encephalomyelitis virus, Fort morgan virus, Getah virus, Kyzylagach virus, 
Mayoaro virus, Middleburg virus, Mucambo virus, Ndumu virus, Pixuna 
virus, Tonate virus, Triniti virus, Una virus, Western equine encephalomyelitis 
virus, Whataroa virus, Venezuelan equine encephalomyelitis virus (VEE), and 
Ross River virus. In a preferred embodiment, the RNA-dependent RNA 
polymerase of the invention is derived from an alphavirus such as, e.g., a 
Sindbis virus, a Semliki Forest virus, and/or an Aura virus. 
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[00129] In certain embodiments of the invention, the RNA-dependent RNA 
polymerase that is encoded by the first open reading frame of the nucleic acid 
molecules of the invention will be a non-cytopathic RNA-dependent RNA 
polymerase. The RNA-dependent RNA polymerase may, alternatively, be a 
temperature sensitive RNA-dependent RNA polymerase. In other 
embodiments, the RNA-dependent RNA polymerase may be a non-cytopathic, 
temperature-sensitive RNA-dependent RNA polymerase. 

[00130] When the RNA-dependent RNA polymerase that is encoded by the 
first open reading frame of the nucleic acid molecules of the invention is a 
temperature-sensitive RNA-dependent RNA polymerase, the RNA-dependent 
RNA polymerase may, e.g., have replicase activity at temperatures below 
34°C and low or undetectable replicase activity at 34°C or above. In certain 
embodiments, the temperature sensitive RNA-dependent RNA polymerase 
will be one that has replicase activity at 34°C which is at least five fold lower 
than the replicase activity exhibited at 29°C. 

[00131] Preferred mutations which confer a non-cytopathic phenotype are those 
that are in the nsp2 gene (eg., the proline residue at position 726 is replaced 
with a serine or a leucine residue). Mutations are known in the art which 
render the replicase protein non-cytopathic (Weiss et aL, J. Virol 35:463-474 
(1980); Dryga et aL, Virology 225:74-83 (1997); Agapov et aL, PNAS 
95:12989-12994 (1998)). In addition, mutations may be introduced into the 
nuclear localization signal of an alphaviral replicase which diminish toxicity 
of the replicon by, e.g., modifying the distribution of the replicase-encoding 
sequence between the cytoplasm and the nucleus. (Rikkonen et aL, Virology 
275:352-361 (1996)). 

[00132] Mutations which render a replicase non-cytopathic may be introduced 
by a number of means, including site directed mutagenesis. Additional 
mutations that render an RNA-dependent RNA polymerase non-cytopathic can 
be identified by, e.g., random mutagenesis and screening using methods and 
techniques that are well-known in the art. As will be understood by persons 
having ordinary skill in the art, the identification of additional mutations that 
render an RNA-dependent RNA polymerase non-cytopathic, temperature 
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sensitive, or both, does not require an ability to predict the structural or 
functional consequences of any particular mutation. 

[00133] About 20 years ago Weiss et al (Weiss, B. et al t J. Virol 35:463-474 
(1980)) established a persistently infected culture of BHK cells. The mutation 
responsible for this phenotype has been recently identified (Dryga, S.A. et al, 
Virology 225:74-83 (1997)). Another mutation allowing the regulation of the 
mRNA transcription via temperature shifts was identified by Burge and 
Pfefferkorn (Burge, B.W. & Pfefferkorn, E.R., Virology 30:203-214 (1966)) 
and described in more detail by Xiong et al. (Xiong, C. et al, Science 
243:1188-1191 (1989)). 

[00134] Temperature sensitivity (ts) may be conferred, for example, by the 
introduction of a mutation in the nsp4 gene of the replicase. Preferably, 
mutations which confer a temperature-sensitive phenotype upon replicase 
activities are in a protein in complementation group F (Lemm et al, J. Virol 
54:3001-3011 (1990)). For example, a temperature-sensitive phenotype may 
be conferred by changing Gly 153 of nsp4 to Glu. Additionally, any other 
mutation which renders replicase activity temperature-sensitive can be used in 
the practice of the invention. Methods for creating and identifying new 
temperature-sensitive mutants are described by Pfefferkorn (Burge and 
Pfefferkorn, Virol 30:204-213(1966); Burge and Pfefferkorn, Virol 
30:214-223 (1966)). Other methods will be appreciated by those of ordinary 
skill in the art. 

[00135] While most temperature-sensitive mutants are "hot" sensitive, "cold" 
sensitive mutations are also known {see, e.g., Schwer, B. et al, Nucleic Acids 
Res. 26:803-809 (1998), Mathe, E. et al, J. Cell Set 7ii:887-896 (1998), 
Doedens, J. et al, J. Virol 77:9054-9064 (1997), Patterson, B. et al, J. Biol 
Chent 272:27612-27617 (1997)). The temperature-sensitive replicase may be 
"cold" or "hot" sensitive and thus will catalyze RNA replication only at 
temperatures either above or below restrictive temperatures. In one 
embodiment, RNA replication occurs at detectable levels only at temperatures 
lower than 34°C. In contrast to all previously known regulatable DNA 
expression systems, the basal level of expression in recombinant host cells 
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containing the pCytTs vector in the inactive state at 37°C is below the level of 
detection using standard methods (e.g., those used in the following examples) 
even under condition of transient transfection. This low level of expression is 
apparent from the data presented in Fig.4. 

[00136] The production of additional temperature sensitive and/or non- 
cytopathic replicases has been described in the art. (Lundstrom et al, Gene 
Ther. Mol Biol. 4:23-31 (1999), Lundstrom et al, Histochem. Cell Biol. 
775:83-91 (2001)). Moreover, with respect to alphaviruses, the results 
obtained in one type of alphavirus can often be used to predict the results 
obtained in other alphaviruses. That is, the identification of mutations that 
render a replicase temperature-sensitive and/or non-cytopathic in one 
alphavirus type may be used to identify or create a similar temperature- 
sensitive and/or non-cytopathic replicase in a different alphavirus. This ability 
is related, in part, to the high level of sequence identity observed among 
alphavirus genomes, especially in their replicase-encoding sequences. 
(Weaver et al, J. Virol 77:613-623 (1997), Smyth et al, J. Virol 77:818-823 
(1997), Kuhn et al, J. Virol 70:7900-7909 (1996)). 

[00137] In certain embodiments, the first nucleotide sequence (comprising a 
first open reading frame encoding an RNA-dependent RNA polymerase) and 
the second nucleotide sequence (comprising, e.g., a second open reading frame 
encoding a polypeptide) are contained on two separate nucleic acid molecules. 
In such an instance, the second nucleotide sequence may carry both cis-acting 
sequences and a 5* region which inhibits translation of the sequence of interest. 
The first nucleotide sequence can also be encoded by a nucleic acid molecule 
which is different than the second nucleotide sequence. Replication and 
translation of the second nucleotide sequence in this multi-vectors system can 
be regulatable by temperature if a temperature sensitive RNA-dependent RNA 
polymerase is used. 

[00138J Additional mutations that render an RNA-dependent RNA polymerase 
non-cytopathic, temperature-sensitive, or both, can be identified by, e.g., 
random mutagenesis and screening using methods and techniques that are 
well-known in the art. As will be understood by persons having ordinary skill 
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in the art, the identification of additional mutations that render an RNA- 
dependent RNA polymerase non-cytopathic, temperature sensitive, or both, 
does not require an ability to predict the structural or functional consequences 
of any particular mutation. The identification of an RNA-dependent RNA 
polymerase that is non-cytopathic, temperature-sensitive, or both, only 
involves, e.g., the generation of random mutations and the functional testing 
of the replicases that are encoded from the mutated nucleic acid molecules. 
Such screening methods are routine in the art. 

[00139] The nucleic acid molecules and expression systems of the invention 
can be also used to express of more than one gene of interest. For example, 
recombinant host cells can be transfected with more than one nucleic acid 
molecule of the invention wherein one nucleic acid molecule encodes both the 
replicase and a polypeptide of interest and additional nucleic acid molecules 
may encode additional polypeptides of interest. Similarly, when mutations 
conferring non-cytopathicity and temperature sensitivity are both used, genes 
encoding polypeptides having suitable mutations (e.g., Pro 726 Ser in nsp2 
and Gly 153 Glu in nsp4) may be on separate nucleic acid molecules. 
Additional variations would be apparent to those skilled in the art. 

[00140] The nucleic acid molecules of the invention may also contain packaging 
signals which direct the packaging of RNA molecules into viral particles. 
These RNA molecules can be packaged in the presence of wild-type virus or 
defective helper virus RNA. A significant improvement was made with the 
development of defective helper RNA molecules (Bredenbeek, P. et aL 9 J. 
Virol 67:6439-6446 (1993)). These RNA molecules contain cfc-acting 
sequences, required for replication of the full-length transcription product, and 
subgenomic RNA promoter sequences which drive the expression of the 
structural protein genes. For example, in cells containing both RNA 
molecules with packaging signals and the defective helper virus RNA, 
alphaviral non-structural proteins allow for replication and amplification of the 
defective helper virus RNA sequences which are translated to produce virion 
structural proteins. Since the helper virus RNA lacks packaging signals, these 
molecules are not packaged into assembled virions. Thus, virion particles 
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produced in this way contain essentially only RNA sequences encoding the 
gene of interest and, generally, other sequences required for 
temperature-sensitive regulation of gene expression. These non-infective 
packaged RNA molecules do not contain sequences encoding virion structural 
proteins and, thus, undergo only one round of host cell infection and are not 
pathogenic. ; 

[00141J Non-infective packaged RNA molecules can be used to infect a culture of 
suitable host cells simply by adding the particles to culture medium containing 
these cells. The preparation of non-infective alpahviral particles is described 
in a number of sources, including "Sindbis Expression System", Version C, 
(Invitrogen Catalog No. K750-1). 

[00142] One exemplary application of this system is directed to the temperature- 
dependent production of non-infective, packaged RNA molecules. These 
packaged RNA molecules may be produced by a number of means including 
using recombinant host cells containing two different nucleic acid molecules 
{e.g., a nucleic acid molecule of the invention and a nucleic acid molecule 
encoding a helper virus RNA sequence). For example, one of these nucleic 
acid molecules will encode, e.g., an RNA molecule which contains packaging 
signal sequences, sequences encoding a non-cytopathic, temperature-sensitive 
replicase, and the gene of interest. The other nucleic acid molecule will 
contain sequences encoding alphaviral structural proteins downstream from an 
alphavirus subgenomic promoter. Using such a system, viral particles 
containing only RNA molecules with packaging signals will be produced at 
permissive temperatures in recombinant host cells. This is so because 
alphaviral structural proteins will only be produced at a permissive 
temperature. Additional variations of the above would be apparent to one 
skilled in the art. 

[00143] The second nucleotide sequence that is included within the nucleic acid 
molecules and expression systems of the invention may be, e.g., a second open 
reading frame encoding a polypeptide. Such a second open reading frame 
may, alternatively or interchangeably, be referred to as a nucleotide sequence 
of interest. A wide variety of nucleotide sequences of interest can be expressed 
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by the nucleic acid molecules and expression systems of the invention. These 
sequences include, but are not limited to, sequences encoding lymphokines, 
cytokines, toxins, enzymes, prodrug converting enzymes, antigens which 
stimulate immune responses, single chain antibodies, polypeptides which 
stimulate or inhibit immune responses, tumor necrosis factors, and various 
proteins with therapeutic applications (e.g., growth hormones and regulatory 
factors). 

[00144] In certain embodiments, the second open reading frame (the second 

nucleotide sequence) of the invention may encode, e.g., a. cytokine or 
lymphokine (e.g., p-interferon). Hematopoiesis is regulated by lymphokines 
and cytokines which stimulate the proliferation and/or differentiation of 
various hemopoietic cells. Representative examples of cytokines and 
lymphokines include interleukin-1 (IL-1), interleukin-2 (IL-2), interleukin-3 
(IL-3), interleukin-4 (IL-4), interleukin-5 (IL-5), interleukin-6 (IL-6), 
interleukin-7 (IL-7), interleukin-8 (IL-8), interleukin-9 (IL-9), interleukin-1 0 
(IL-10), interleukin-1 1 (DL-11), ihterleukin-12 (IL-12), interleukin-1 3 (IL-13), 
interleukin-14 (IL-14), interleukin-1 5 (IL-15), interleukin-1 6 (IL-16), 
interleukin-1 7 (IL-1 7), granulocyte colony stimulating factor (G-CSF), 
granulocyte-macrophage colony stimulating factor (GM-CSF), macrophage 
colony stimulating factor (M-CSF), and interferons. 

[00145] In certain embodiments, the second open reading frame (the second 

nucleotide sequence) of the invention may encode secreted enzymes (e.g., 
secreted alkaline phosphatase), cytoplasmic enzymes (e.g., green fluorescent 
protein), or any number of other proteins with therapeutic applications (e.g., 
human insulin, human coagulation Factor VIII). 

[00146] The nucleic acid molecules and expression systems of the invention 
can also be used to express heterologous sequence encoding cytotoxic 
polypeptides. Cytotoxic polypeptides act to directly or indirectly inhibit 
cellular growth or metabolism. Representative examples of toxins include 
Shigella toxin, ricin, Diphtheria toxin, Cholera toxin, Pseudomonas 
exotoxin A, and Herpes simplex virus thymidine kinase (HSVTK). Within 
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other embodiments of this invention, the heterologous sequence encodes a 
prodrug converting enzyme. A prodrug converting enzyme activates a 
compound with little or no cytotoxicity into a toxic product. Representative 
examples are HSVTK, alkaline phosphatase, guanine phosphoribosyl 
transferase, and penicillin- V amidase. Examples of both cytotoxic 
polypeptides and prodrug converting enzymes are discussed in numerous 
sources including PCT/US97/06010, EP 0716148, and WO 96/17072. In 
addition, a vast array of signaling molecules and membrane proteins are toxic 
if expressed at high levels. All these molecules may be suitable for expression 
using the system of the present invention. 

[00147] The second nucleotide sequence of the nucleic acid molecules and 
expression systems of the invention may also be a nucleotide sequence 
encoding an untranslated RNA molecule or complement thereof. Exemplary 
untranslated RNA molecules include, e.g., antisense sequences, RNase P 
targeted sequences which induce gene down-regulation, and ribozymes. 
Smith S. et al (J. Virol 77:9713-9721 (1997)) describes alphaviral vectors 
used to express ribozyme sequences. 

[00148] The nucleic acid molecules and expression systems of the invention 
can also be used to express virtually any polypeptide, including ones which 
have not as yet been identified but are encoded by nucleotide sequences 
contained in, for example, cDNA libraries or host cell chromosomes. 
Example of such polypeptides include secreted proteins and proteins from 
various cellular compartments. Heterologous sequences expressed by the 
vectors of the invention can encode polypeptides and RNA molecules from 
non-human species {e.g., other mammals, plants, fungi, bacteria or viruses). 
These heterologous sequences may further encode viral membrane proteins 
(e.g., HIV gpl60) or viral polyproteins (e.g., Sindbis structural proteins). 

[00149] Nucleotide sequences may be added to the nucleic acid molecules and 
vectors of the invention which result in the production of a fusion protein. For 
example, such sequences can encode amino acids sequences which are fused 
to a protein encoded by a gene of interest and confer one or more functional 
characteristics upon the expression product. These amino acid sequences 
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include sequences which will target the gene product for export from the cell 
(e.g., a secretory sequence) or to a subcellular compartment (e.g., the nucleus). 
Such amino acid sequences further include sequences which facilitate 
purification (e.g., a six His "tag"). Depending on the amino acid sequence and 
the function imparted by the fused sequence, the added amino acid sequences 
may or may not be cleaved from the translation product. 
[00150] Fusion proteins also include proteins which have domains or regions 
derived from various different proteins. Examples of such a fusion protein are 
those containing domain II of Pseudomonas exotoxin, a domain or amino acid 
sequence which has binding affinity for a cell surface receptor associated with 
a particular cell type, and another amino acid sequence with a preselected 
biological activity. Domain II of Pseudomonas exotoxin will translocate 
across cell membranes. Using this system, fusion proteins can be designed 
which will bind to specific cells types, will translocate across the cytoplasmic 
membranes of these cells, and will catalyze predetermined intracellular 
biological reactions. A system of this type is described in Pastan et al, U.S. 
Patent No. 5,705,163. Methods for identifying amino acid sequences, which 
bind to specific cell types are described in Wu, A., Nature Biotech 14:429- 
431 (1996). 

[00151] The nucleic acid molecules and expression systems of the invention 

may, in certain embodiments, fiirther comprise genetic elements which confer 
additional functional characteristics such as selection markers. 

[00152] Markers for the selection of prokaryotic and eukaryotic cells 
containing vectors the present invention are well known in the art and include 
puromycin, tetracycline, ampicillin, neomycin, and kanamycin resistance. 
Nucleotide sequences which result in high copy number amplification are also 
known in the art and include the ColEl sequence contained in the pCytTs 
vector. 



RECOMBINANT HOST CELLS 

[00153] The invention includes methods of making recombinant host cells and 

recombinant host cells produced using the methods of the invention. For 
example, the methods of the invention comprise introducing one or more 
nucleic acid molecules or expression systems described herein into a host cell. 
A variety of different recombinant host cells can be produced which contain 
the nucleic 4 acid molecules and expression systems of -the invention. 
Alphaviruses, for example, are known to have a wide host range. Sindbis 
virus, for example, infects cultured mammalian, reptilian, and amphibian cells, 
as well as some insect cells (Clark, H., J. Natl Cancer Inst 57:645 (1973); 
Leake, C, J. Gen. Virol 35:335 (1977); Stollar, V. in The Toga viruses, 
R.W. Schlesinger, Ed., Academic Press, (1980), pp.583-621). 

[00154] Thus, numerous host cells can be used in the practice of the invention. 

Representative host cells that may be used with the invention include, but are 
not limited to, bacterial cells, yeast cells, plant cells and animal cells. 
Preferred bacterial host cells include Escherichia spp. cells (particularly E. coli 
cells and most particularly E. coli strains DH10B, Stbl2, DH5, DB3, DB3.1, 
DB4 and DBS), Bacillus spp. cells (particularly B. subtilis and B. megaterium 
cells), Streptomyces spp. cells, Erwinia spp. cells, Klebsiella spp. cells, 
Serratia spp. cells (particularly S. marcessans cells), Pseudomonas spp. cells 
(particularly P. aeruginosa cells), and Salmonella spp. cells (particularly S. 
typhimurium and S. typhi cells). Preferred animal host cells include insect 
cells (most particularly Drosophila melanogaster cells, Spodoptera frugiperda 
Sf9 and S£21 cells and Trichoplusa High-Five cells), nematode cells 
(particularly C. eiegans cells), avian cells, amphibian cells (particularly 
Xenopus laevis cells), reptilian cells, and mammalian cells (most particularly 
NIH3T3, CHO, COS, VERO, BHK, HEK, other rodent cells, and human 
cells). Preferred yeast host cells include Saccharomyces cerevisiae cells and 
Pichia pastoris ceils. BHK, COS, Vero, HeLa and CHO cells are particularly 
suitable for the production of heterologous polypeptides because they have the 
potential to glycosylate heterologous proteins in a manner similar to human 
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cells (Watson, E. et al, Glycobiology 4:227, (1994)) and can be selected 
(Zang, M. et al, Bio/Technology 73:389 (1995)) or genetically engineered 
(Renner W. et al, Biotech Bioeng. 47:476 (1995); Lee K. et al. Biotech. 
Bioeng 50:336 (1996)) to grow in serum-free medium, as well as in 
suspension. 

[00155] Example 2 shows that the expression system of the present invention 

can function in BHK cells and 293HEK cells. Moreover, we have previously 
demonstrated that the pCytTs system works efficiently in additional cells lines 
such as CHO-K1 and COS-7 (Boorsma et at., Nature Biotech. 75:429-432 
(2000)). 

(00156] The nucleic acid molecules and/or expression systems of the invention 
may be introduced into host cells using well known techniques of infection, 
transduction, electroporation, transfection, and transformation. Exemplary 
methods include DEAE-dextran mediated transfection, transient transfection, 
microinjection, cationic lipid-mediated transfection, scrape loading and 
ballistic introduction. Methods for the introduction of exogenous DNA 
sequences into host cells are discussed in Feigner, P. et al. 7 U.S. Patent No. 
5,580,859. The nucleic acid molecules and/or vectors of the invention may be 
introduced alone or in conjunction with other nucleic acid molecules and/or 
vectors and/or proteins, peptides or RNAs. Alternatively, the nucleic acid 
molecules and/or expression systems of the invention may be introduced into 
host cells as a precipitate, such as a calcium phosphate precipitate, or in a 
complex with a lipid. Electroporation also may be used to introduce the 
nucleic acid molecules and/or expression systems of the invention into a host. 
Likewise, such molecules may be introduced into chemically competent cells 
such as E. coli. If the vector is a virus, it may be packaged in vitro or 
introduced into a packaging cell and the packaged virus may be transduced 
into cells. Hence, a wide variety of techniques suitable for introducing the 
nucleic acid molecules and/or vectors of the invention into host cells are well 
known and routine to those of skill in the art. Such techniques are reviewed at 
length, for example, in Sambrook, J., et al., Molecular Cloning, a Laboratory 
Manual, 2nd Ed., Cold Spring Harbor, NY: Cold Spring Harbor Laboratory 
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Press, pp. 16.30-16.55 (1989), Ausubel, F. aL, eds., Current Protocols 
in Molecular Biology, John H. Wiley & Sons, Inc. (1997), Chapter 16), 
Watson, J.D., et aL, Recombinant DNA, 2nd Ed., New York: W.H. Freeman 
and Co., pp. 213-234 (1992), and Winnacker, E.-L., From Genes to Clones, 
New York: VCH Publishers (1987), which are illustrative of the many 
laboratory manuals that detail these techniques and which are incorporated by 
reference herein in their entireties for their relevant disclosures. 
[00157] The invention is also directed to recombinant host cells comprising one 
or more nucleic acid molecules or expression systems of the invention. Also 
included within the present invention are in vitro cell cultures that comprise 
such recombinant host cells. Methods for producing recombinant host cells 
and cell cultures comprising the same are well-known in the art. 

PRODUCTION OF POLYPEPTIDES AND RNA MOLECULES 

[00158] The nucleic acid molecules, expression systems and recombinant host 

cells of the invention may be used for the production of proteins, polypeptides 
and RNA molecules, e.g., untranslated .RNA molecules. The methods of the 
invention may comprise, e.g., introducing one or more nucleic acid molecules 
or expression systems of the present invention into host cells to produce 
recombinant host cells, culturing the recombinant host cells under conditions 
suitable for expression of the polypeptide or untranslated RNA molecule, and 
recovering the polypeptide or untranslated RNA molecule. 

[00159] The invention also provides methods for regulating the expression of a 
polypeptide or an untranslated RNA molecule. The methods of the invention 
may comprise, e.g., (a) introducing one or more nucleic acid molecules or 
expression systems of the invention into a host cell to produce a recombinant 
host cell, wherein said nucleic acid molecules or expression systems comprise 
a first open reading frame having a nucleotide sequence encoding a 
temperature-sensitive RNA-dependent RNA polymerase, (b) growing said 
recombinant host cell under suitable culture conditions, and (c) changing the 
temperature of the recombinant host cell culture from: (i) a permissive 
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temperature to a restrictive temperature, or (ii) a restrictive temperature to a 
permissive temperature. 

[00160] The present invention also provides methods for producing 

polypeptides and RNA molecules, said methods comprising introducing one or 
more nucleic acid molecules or expression systems of the invention into 
recombinant host cells, expanding the cells at a non-permissive temperature 
and incubating these cells at a permissive temperature. In a related aspect, the 
invention provides purified polypeptides and RNA molecules produced 
according to the methods of the present invention. 

[00161] Depending on the molecule, which is expressed, it may be obtained 
either from the culture supernatant or by lysing the recombinant host cells. 

[00162] Polypeptides produced using the nucleic acid molecules and 

expression systems of the invention can be recovered and purified from 
recombinant cell cultures by methods known in the art including ammonium 
sulfate precipitation, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography, and high 
performance liquid chromatography. Methods for purifying proteins are 
described in numerous sources (see, e.g., Celis, J., ed., Cell BIOLOGY, 
Academic Press, 2 nd edition, (1998)). 

[00163] Untranslated RNA molecules produced using the nucleic acid 
molecules and expression systems of the invention can be recovered and 
purified from recombinant cell cultures by methods known in the art {see, e.g., 
Celis, J., ed., CELL BIOLOGY, Academic Press, 2 nd edition, (1998)). Methods 
for recovering and/or purifying RNA molecules include phenol/chloroform 
extraction, digestion with DNAses followed by precipitation of the undigested 
RNA molecules, and column chromatography (e.g., oligo dT column 
chromatography). Further, RNA molecules can be separated from other 
cellular material using the single-step guanidinium-thiocyanate-phenol- 
chloroform method described in Chomczynski and Sacchi, Anal Biochem. 
752:156-159(1987). 
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[00164] The overall cell culture process employing nucleic acid molecules and 

expression systems of the invention for the production of polypeptides and/or 
untranslated RNA molecules can be implemented in a variety of bioreactor 
configurations (e.g., stirred-tank, perfused, membrane enclosed, encapsulated 
cell, fluidized bed, and air-lift reactors) and scales (from laboratory T-flasks to 
thousands of liters), chosen to accommodate the requirements of the host cell 
line utilized (e.g., anchorage dependency, 0 2 concentrations), to maximize the 
production of expression product, and to facilitate subsequent recovery and 
purification of expression product. 

[00165] The invention is also directed to the production of polypeptides or 

RNA molecules of interest using mammalian cells grown in serum-free or 
protein-free culture media. For example, by long-term culture under 
conditions restricting serum access or selecting for suspension growth, CHO 
cell lines are selected which are able to grow in serum-free medium and/or in 
suspension (Zang. M. et al., Bio/Technology 73:389 (1995)). 

[00166] Further, a number of different bioprocess parameters can be varied in 

order to alter the glycosylation pattern of polypeptide products produced by 
the recombinant host cells of the invention. These factors include medium 
composition, pH, oxygen concentration, lack or presence of agitation, and, for 
the case of anchorage-dependent cells, the surface provided. Thus, the 
glycosylation pattern of glycoproteins may be altered by choosing the host cell 
in which these proteins are expressed in and the conditions under which the 
recombinant host cells are grown. 

PHARMACEUTICAL COMPOSITIONS 

[00167] The invention farther provides pharmaceutical compositions 
comprising nucleic acid molecules and/or expression systems and/or 
recombinant host cells of the invention. The pharmaceutical compositions of 
the invention may comprise nucleic acid molecules and/or expression systems 
and/or recombinant host cells of the invention in combination (e.g., in 
solution) with a physiologically acceptable carrier and in a therapeutically 
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effective amount. The administration of these pharmaceutical compositions 
may, for example, result in expression of a polypeptide in tissues of an animal 
which is immunogenic and intended to function as a vaccination. Similarly, 
the nucleic acid molecules and/or expression systems and/or recombinant host 
cells of the invention may carry sequences that encode polypeptides or RNA 
molecules required for the treatment of an active affliction. The 
administration of a pharmaceutical composition of the invention will thus be 
intended to have a therapeutic effect in these instances. 

[00168] The nucleic acid molecules and/or expression systems and/or 
recombinant host cells of the invention will normally be administered to an 
individual in a pharmacologically acceptable carrier. A composition is said to 
be "pharmacologically acceptable" if its administration can be tolerated by a 
recipient individual. Further, the composition of the invention will be 
adnrinistered in a "therapeutically effective amount" (e.g., an amount that 
produces a desired physiological effect). 

[00169] As would be understood by one of ordinary skill in the art, when the 
nucleic acid molecules and/or expression systems and/or recombinant host 
cells of the invention are administered to an individual, they may be in a 
composition which contains salts, buffers, adjuvants, or other substances 
which are desirable for improving the efficacy of the composition. Examples 
of materials suitable for use in preparing pharmaceutical compositions are 
provided in numerous sources including Remington's Pharmaceutical 
Sciences (Osol, A, ed., Mack Publishing Co., (1980)). 

[00170] The pharmaceutical compositions of the present invention can be 
administered by various art known means but will normally be administered 
by injection, infusion or other suitable physical methods. The compositions 
may alternatively be administered intramuscularly, intravenously, or 
subcutaneously. Components of compositions for administration include 
sterile aqueous (e.g., physiological saline) or non-aqueous solutions and 
suspensions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic 
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esters such as ethyl oleate. Carriers or occlusive dressings can be used to 
increase skin permeability and enhance antigen absorption. 
100171] When recombinant host cells are administered to an individual, the 

number of cells or nucleic acid molecules required to provide a therapeutically 
effective amount will vary with such factors as the individual's condition, the 
polypeptides or RNA molecules intended to be expressed, and the size of the 
individual. 

KITS 

[00172] The invention also provides kits comprising the isolated nucleic acid 
molecules, expression systems and/or recombinant host cells of the invention. 
The kits of the invention may optionally comprise one or more additional 
components selected from the group consisting of one or more containers 
(e.g., boxes, vials, tubes, jars ampules, etc.) one or more vectors, one or more 
nucleotides, one or more primers, one or more polypeptides having 
polymerase activity, one or more host cells (e.g., host cells that may be 
competent for uptake of nucleic acid molecules), and one or more buffers. 

100173] It will be readily apparent to one of ordinary skill in the relevant arts 
that other suitable modifications and adaptations to the methods and 
applications described herein are obvious and may be made without departing 
from the scope of the invention or any embodiment thereof. Having now 
described the present invention in detail, the same will be more clearly 
understood by reference to the following examples, which are included 
herewith for purposes of illustration only and are not intended to be limiting of 
the invention. 
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EXAMPLE 1 

CONSTRUCTION OF THE pCytTs VECTOR SYSTEM 

[00174] Manipulations and sequencing of DNA were carried out by standard 
procedures. The mutations in nsP2 were introduced by PCR using the 
following oligonucleotides: 

[00175] oligo-nsp2 1: 5'-AACATTGAA ATCGATA TTACAGGGG (SEQ ID 
NO:2), 

[00176] oligo-nsp2 2: 5'-CGGGTTAT GGTCGAC CGGGC (SEQ ID NO:3), 
[00177] oligo-nsp2 3 :5'-GTGCCCTCCCCTGA GTTTAAAC AATTCAGGGC 

CGAACGCG (SEQ ID NO:4), and 
[00178] oligo-nsp2 4: S'-GAATT GTTTAAA CTCAGGAGGCACCCTCGTOG 

(SEQ ID NO:5). 

[00179] The single restriction sites used for first analysis and subsequent 
cloning (Dral, Clal and Salt) are underlined. PCR reactions were performed 
using either oligo-nsp2 1 (SEQ ID NO:2) and oligo-nsp2 3 (SEQ ID NO:4) or 
oligo-nsp2 2 (SEQ ID NO:3) and oligo-nsp2 4 (SEQ ID NO:5). 100 pmol of 
each oligo was used and 5 ng of the template DNA (pSinRepS; Xiong, C. et 
al, Science 243:1188-1191 (1989)) was used in the 100 /A reaction mixture, 
containing 4 units of Taq or Pwo polymerase, 0.1 mM dNTPs and 1.5 mM 
MgS0 4 . All DNA concentrations were determined photometrically using the 
GeneQuant apparatus (Pharmacia Biotech Inc., 800 Centennial Ave., 
Piscataway, NJ. 08854). The polymerase was added directly before starting 
the PCR reaction (starting point was 95°C). The temperature cycles were as 
follows: 95°C for 2 minutes, followed by 5 cycles of 95°C (45 seconds), 58°C 
(30 seconds), 72°C (90 seconds) and followed by 25 cycles of 95°C (45 
seconds), 68°C (30 seconds), 72°C (90 seconds). 

[00180] The two PCR fragment were purified using Qia spin PCR kit 
(QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311) and finally 
digested in an appropriate buffer using 20 units of Sail and Dral, respectively 
20 units of Clal and Dral. The digestion was performed for 12 hours at 
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37DC. The DNA fragments were gel-purified (Gene-Clean; Bio 101 Inc., 
1070 Joshua Way, Vista, CA, 92083, USA) and finally ligated into ClaVSaR 
digested and gel-purified SinRepS vector (Xiong, C. et ah, Science 243:1188- 
1191 (1989). The correct sequence of the obtained vector was checked by 
DNA sequencing of the whole nsP2 gene. 

[00181] The mutations in nsP4 were also introduced by PCR using the 
following oligonucleotides: 

[00182] oligo-nsp4 1: S'-GGTAGACGAGACAGT CGCATGCC TGGATAC 
(SEQIDNO:6), 

[00183] oligo-nsp4 2: S'-GTATCCA GGCATGC GACTGTCTCGTCTACC 
(SEQIDNO:7), 

[00184] oligo-nsp4 3: 5'-CAGACCG GTTAAC GCCATAGCG TCG (SEQ ID 
NO:8), and 

[00185] oligo-nsp4 4: 5'-CTCTAT TACTAGTA TGGACAGTTGG (SEQ ID 
NO:9), 

[00186] The singular restriction sites used for the first analysis and the final 
cloning step (Sphl, Hpal and Spel) are underlined. Two PCR reactions were 
carried out as described above using either oligo-nsp4 1 (SEQ ID NO:6) and 
oligo-nsp4 3 (SEQ ID NO:8) or oligo-nsp4 2 (SEQ ID NO:7) and oligo-nsp4 4 
(SEQIDNO:9). 

[00187] Both PCR products were gel-purified and then used in assembly PCR 
to amplify the whole nsP4 gene. For the assembly PCR, 50 pmol of the outer 
primers (3 and 4) and about 10 ng of each PCR fragment was used. The 
reaction volume was 100 ul, containing 4 units of Taq or Pwo polymerase, 0.1 
mM dNTPs and 1.5 mM MgS04 . The PCR conditions were as followed: 

[00188] 95°C for 2 minutes, followed by 5 cycles of 92°C (45 seconds), 58°C 
(30 seconds), 72°C (120 seconds) and followed by 25 cycles of 92°C (45 
seconds), 64°C (30 seconds), 72°C (120 seconds). 

[00189] The obtained PCR fragment was purified as described above and the 
eluate was digested with 20 units of Spel and Hpal in an appropriate buffer. 
The fragment was gel-purified and ligated into gel-purified SpeVHpal 
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restricted SinRep5 vector. The correct sequence of the obtained vector was 
checked by DNA sequencing. 
[00190J Overnight digestion of SinRep5-nsP4mut and SinRep5-nsp2mut with 
SpeVHpal and gel purification of the nsp4 fragment and sinRep-nsp2mut 
vector. The nsp4mut fragment was ligated into the SinRep5-nsp2mut vector. 
The final step was cloning the nsp gene into the 987/SinRep5 vector 
(Bredenbeek, P. etal., J. Virol. 67:6439-6446 (1993)) using Clal andHpal as 
restriction endonucleases, the resulting vector was named pCytTs (Fig. 1 and 
Figs. 2A-2D (SEQ ID NO:l)). 

EXAMPLE 2 

COMBINATION OF THE TEMPERATURE SENSITIVE pCytTs SYSTEM 
WITH THE SELF REPLICATING EBNA SYSTEM 

[00191] This system was generated in order to rapidly generate cell 
populations, which inducibly express a gene of interest. In this system, the 
tightly regulated pCytTs system is combined with the episomally replicating 
EBV system. 

A. Construction of various pCytTs constructs containing either the EBNA 
origin of replication (OriP) alone or in conjunction with the replication 
initiation factor EBNA1 . 

[00192] In order to be able to subclone either the origin of replication of 
Epstein-Barr virus alone or together with the replication initiation factor 
EBNA1 we first subcloned this two cassettes into a shuttle vector from which 
they can easily be transferred into our pCyt TS system with or without various 
inserts. The Epstein-Barr virus origin of replication OriP was amplified by 
PCR from pCEP4 (Invitrogen) using primers PH 51 and PH 52. The forward 
primer (PH 51) corresponds to nucleotides 261-280 of pCep4 and the reverse 
primer (PH 52) to nucleotides 2221-2239 of pCep4. PCR was performed using 
pfx polymerase (BRL) according to manufacturer's recommendations. The 
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resulting PCR product (nucleotides 261-2239 of P Cep4) was then cloned into 
pGEMTeasy vector (Invitrogen), which had previously been digested with Eco 
RI and treated with Klenow polymerase in order to get blunt ends. The 
resulting plasmid was termed pGemT-OriP. In order to get a subclonable 
cassette containing OriP and EBNA1 gene, pCep4 was digested with Dra III 
and Cla I. The resulting 4651 bp fragment was purified and blunt ends were 
generated by T4 DNA polymerase treatment. This fragment was then 
subcloned into pGEMTeasy vector (Invitrogen), which had previously been 
digested with Eco RI and treated with Klenow polymerase in order to get blunt 
ends. The resulting construct was termed pGemT-OPE. From these plasmids 
either the origin of replication alone (pGemT-OriP) or together with the 
replication initiation factor (pGemT-OPE) could be excised by Not I digestion 
and transferred into Not I sites of the various CytTs vectors. pCytTs-IFNp, 
pCytTs-EGFP, pCytTs-SEAP and ecCytTs-cEPO vectors were linearized with 
Not I and the Not I fragment of pGemT-OPE was introduced by ligation 
leading to the following constructs: pCytTs-IFNp-OPE, pCytTs-EGFP-OPE, 
pCytTs-SEAP-OPE and pCytTs-EPO-OPE. Similarly pCytTs-EPO-OriP and 
pCytTs-EGFP-OriP were generated by cloning the Not I fragment from 
pGemT-OriP into the Not I site of ecCytTs-cEPO and pCytTs-EGFP, 
respectively. 

[00193] Initial experiments were performed with a construct in which a slightly 
truncated form of OriP had been subcloned into pCytTs-EGFP. P Cep4 
(Invitrogen) was digested with Dra III and Nsi I and the 1923 nucleotide 
fragment (position 200-2124 from pCep4) was then treated with T4 
polymerase in order to get blunt ends. The blunted fragment was then 
subcloned into pCytTs-EGFP, which had previously been digested with Not I 
and blunted with T4 DNA polymerase. The resulting plasmid was termed 
pCytTs-EGFP-OriPd. 
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B. Analysis of pCytTs-EGFP-OriPd in 293 EBNA cells 

[001941 We first analyzed whether the introduction of Epstein-Barr virus 
derived origin of replication (OriP) into the pCytTs system would improve the 
inducible expression of a gene of interest on selected cell populations. We 
therefore transfected 293-EBNA cells (Invitrogen) with pCytTs-EGFP-OriPd. 
These cells constitutively express Epstein-Barr virus replication initiation 
factor EBNA1 and support OriP mediated DNA replication. 293-EBNA cells 
were transfected with pCytTs-EGFP-OriPd using Lipofectamine 2000 (BRL) 
according to the manufacturer's recommendation. 36h after transfection cells 
were split 1 to 3 in presence of ljug/ml puromycin. After 1 to 2 days all non- 
transfected cells had detached and the resistant cells were further expanded 
under puromycin selection for another week. In order to asses the expression 
from the new vectorj resistant cell populations were passed to two plates of 
which one was kept at 37°C (uninduced) and one was shifted to 29°C for 
induction of EGFP expression. 24h after induction cells were harvested by 
trypsinisation and analysed by flow cytometry (Fig. 3). Upon induction cell 
populations carrying pCytTs-EGFP-OriPd were to 80% EGFP positive and 
displayed strong EGFP expression. In contrast cells, which had not been 
induced showed only weak EGFP expression and only about 8% of the cells, 
were green. Considering that stable cell populations containing CytTs-EGFP 
would only yield about 2-5% EGFP positive cells upon induction (data not 
shown) these results suggest that the OriP, which was introduced in the new 
vector, is functional and that it significantly improves the number of cells in a 
cell population, which expresses the gene of interest. 

C. Comparison of pCytTs-EGFP, pCytTs-EGFP-OriP and pCytTs- 
EGFP-OPE in BHK cells 

[00195] The following constructs, pCytTs-EGFP, pCytTs-EGFP-OriP and 
pCytTs-EGFP-OPE, were next examined in BHK (Baby hamster kidney) cells 
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which are known to show tight regulation for the CytTs system. BHK cells 
were transfected with CytTs-EGFP, CytTs-EGFP-OriP and CytTs-EGFP-OPE 
using Lipofectamine 2000 (BRJL) according to the manufacturer's 
recommendation. 24h after transfection cells were split 1 to 3 in presence of 
5|ig/ml puromycin. After 2 to 3 days all non-transfected cells had detached 
and the resistant cells were further expanded under puromycin selection for 
another week. Resistant cell populations were passed to two plates of which 
one was kept at 37 °C (non-induced) and one was shifted to 29 °C for 
induction of replicon replication and EGFP expression. Three days after 
induction cells were harvested and analysed by flow cytometry (Fig. 4). About 
20% of the cell populations carrying CytTs-EGFP or CytTs-EGFP-OriP were 
EGFP positive upon induction, whereas four times more cells - up to 90% of 
the cells transfected with CytTs-EGFP-OPE did express EGFP after 
temperature induction. Thus, the inventive expression systems turned out to 
be very tight since less than 0.5% of the uninduced population showed weak 
EGFP expression. 

D. Inducible expression of pCytTs-IFNp-OPE and pCytTs-SEAP- 
OPE in BHK cells 

[00196] We further analyzed constructs which contain both OriP and the 

replication initiation factor EBNA1 (pCytTs-IFNp-OPE) with other genes of 
interest in BHK cells. These constructs do not depend on cell lines which have 
an integrated copy of the EBNA1 gene. As a gene of interest we choose 
IFNp, which had already been expressed with the conventional CytTs system. 
To test whether our new vector system with its episomal replication could 
result in high, inducible levels of expression on selected cell populations we 
transfected BHK cells with pCytTs-IFNp-OPE (see Example 2A) using 
Lipofectamine 2000 (BRL) according to the manufacturer's recommendation. 
36h after transfection cells were split 1 to 3 in the presence of 5jig/ml 
puromycin. After 1 to 2 days all non-transfected cells had detached and the 
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resistant cells were farther expanded under puromycin selection for another 1 - 
2 weeks. In order to assess the expression from the new vector, resistant cell 
populations were passed to two plates one of which was kept at 37°C 
(uninduced) and one was shifted to 29°C for induction of IFNp expression. 
Tissue culture supernatants were harvested from the uninduced control plate 
and 3, 4, 7 and 9 day after induction from the plate that had been shifted to 29 
°C. The IFNp production at the different time points was than quantitated by 
ELISA (Kit of Fujirebio Inc., Tokyo, Japan). The induced cell populations 
reached a maximal level of 1,000,000 IU/ml after 7-9 days of induction and no 
IFN production could be detected in cells kept at non-permissive temperature 
(Figs. 3A-3B). These results show that our new vector system combining the 
CytTs system with the episomaly replicating EBNA system leads to high 
production levels in BHK cells on selected populations. The expression levels 
are at least as high as the ones obtained from a stable cell clone, which 
resulted from 4 subcloning rounds (data not shown). In addition the new 
system is still tightly regulated as demonstrated by up to 300,000 fold 
induction observed upon shifting the cells to 29°C. 
[001971 Besides IFNp production we also analysed SEAP production with our 
new vector system in BHK cells. Selected cell populations were generated as 
described under C using construct pCytTs-SEAP-OPE. After two weeks of 
selection and propagation the resistant cells were passed into two plates of 
which one was kept at 37°C and the other was shifted to 29°C for induction. 
Samples were taken from the uninduced cell populations and from the induced 
cell population 3, 4, 7 and 9 days after induction. SEAP concentrations were 
measured in these samples using a kinetic ELISA for SEAP activity. The 
results are shown in Fig. 6. 
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E. Comparison of the EPO production derived from pCytTs-OriP- 
EPO or with pCytTs-OPE-EPO in 293 EBNA cells 

[00198] We further investigated whether an additional copy of the EBNA-1 
gene on the same plasmid leads to higher expression of the gene of interest in 
293-EBNA cells. 293-EBNA cells were transfected either with pCytTs-OriP- 
EPO or with pCytTS-OPE-EPO (see Example 2A) using Lipofectamin 2000 
(BRL) according to the manufacturer's recommendation. One day after 
transfection the cells were split 1 to 2.5 in the presence of 0.8 \ig/m\ 
puromycin. A stable cell pool was obtained by further keeping these cells 
under selective pressure for one week. Resistant cell populations were then 
passed to two plates. Once the cells had attached, one plate was kept at 37°C 
and one plate was shifted to 29°C for induction. Cell culture supernatants were 
collected from both plates 4 and 8 days after induction. The erythropoietin 
(EPO) levels in the cell culture supernatants were then determined with an 
ELISA kit (R&D Systems, Minneapolis, USA). As shown in Fig. 7 the 
additional copy of the EBNA1 gene on the plasmid leads to a 15 to 20 fold 
increase in the EPO production compared to the plasmid which only contains 
the OriP. In conclusion, this experiment shows that, even in cells which 
contain an integrated copy, the expression of the gene of interest can be 
increased by an additional copy on the plasmid. These results suggest that the 
EBNA1 dosage may be a limiting factor for efficient replication to occur. In 
these experiments the EBNA1 dosage was increased by a copy on the OriP- 
containing plasmid. Similar results may be obtained if the EBNA1 dosage is 
increased in the cells either by the integration of further copies or if the 
additional copies would be provided on a separate plasmid in trans. 
[00199] Having now fully described the present invention in some detail by 
way of illustration and example for purposes of clarity of understanding, it 
will be obvious to one of ordinary skill in the art that the same can be 
performed by modifying or changing the invention within a wide and 
equivalent range of conditions, formulations and other parameters without 
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affecting the scope of the invention or any specific embodiment thereof, and 
that such modifications or changes are intended to be encompassed within the 
scope of the appended claims. 
[00200] All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to 
which mis invention pertains, and are herein incorporated by reference to the 
same extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 . A nucleic acid molecule comprising: 

(A) a first polynucleotide element which encodes an RNA 
molecule, said RNA molecule comprising: 

(a) at least one cw-acting sequence element; 

(b) a first nucleotide sequence comprising a first open 
reading frame, said first open reading frame having a 
nucleotide sequence encoding an RNA-dependent RNA 
polymerase; and 

(c) at least one second nucleotide sequence selected from 
the group consisting of: 

(i) a second open reading frame encoding a 
polypeptide; 

(ii) a nucleotide sequence complementary to all or a 
part of the second open reading frame of (i); and 

(iii) a nucleotide sequence encoding an untranslated 
RNA molecule or complement thereof; 

wherein said second nucleotide sequence is operably 
linked to a promoter which is recognized by said RNA- 
, dependent RNA polymerase; 

(B) a second polynucleotide element comprising an origin of 
replication; and 

(C) a third polynucleotide element encoding a replication initiation 
factor capable of recognizing said origin of replication. 

2. The nucleic acid molecule of claim 1, wherein said second open 
reading frame is in a translatable format after one or more RNA- 
dependent RNA replication events. 

3. The nucleic acid molecule of claim 1, wherein said RNA-dependent 
RNA polymerase is selected from the group consisting of: (a) a 
temperature-sensitive RNA-dependent RNA polymerase; (b) a non- 
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cytopathic RNA-dependent RNA polymerase; and (c) a temperature- 
sensitive, non-cytopathic RNA-dependent RNA polymerase. 

The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from a prokaryotic organism or a eukaryotic 
organism. 

The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from a virus. 

The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from a DNA virus. 

The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from a DNA virus that allows, for episomal 
replication. 

The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from a DNA virus selected from the group 
consisting of Papillomavirus, Polyomavirus, Adenovirus, and 
Hepadnaviruses. 

The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from a Herpesvirus. 

The nucleic acid molecule of claim 1, wherein said origin of 
"replication is derived from Epstein-Barr virus (EBV). 

The nucleic acid molecule of claim 10, wherein said origin of 
replication is or/P. 
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The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from an eukaryotic organism selected from the 
group consisting of yeast, and insects. 

The nucleic acid molecule of claim 1, wherein said origin of 
replication is derived from a mammal. 

The nucleic acid molecule of claim 1, wherein said replication 
initiation factor is capable of operating as a plasmid maintenance 
factor. 

The nucleic acid molecule of claim 1, wherein said replication 
initiation factor is derived from a prokaryotic organism or a eukaryotic 
organism. 

The nucleic acid molecule of claim 1, wherein said replication 
initiation factor is derived from a virus. 

The nucleic acid molecule of claim I, wherein said replication 
initiation factor is derived from a DNA virus. 

The nucleic acid molecule of claim 1, wherein said replication 
initiation factor is derived from a DNA virus selected from the group 
consisting of Herpesvirus, Papillomavirus, Polyomavirus, Adenovirus, 
and Hepadnavirus. 

The nucleic acid molecule of claim 1, wherein said replication 
initiation factor is derived from a Herpesvirus. 

The nucleic acid molecule of claim 1, wherein said replication 
initiation factor is derived from EBV. 
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21. The nucleic acid molecule of claim 20, wherein said replication 
initiation factor is the EBNA-1 protein. 

22. The nucleic acid molecule of. claim 1, wherein said replication 
initiation factor and said origin of replication are derived from the 
same organism or the same virus. 

23. The nucleic acid molecule of claim 1, wherein said replication 
initiation factor and said origin of replication are derived from different 
organisms or viruses. 

24. The nucleic acid molecule of claim 1 further comprising a fourth 
polynucleotide element, wherein said fourth polynucleotide element 
comprises a selection marker. 

25. The nucleic acid molecule of claim 24, wherein said selection marker 
confers resistance to Puromycin. 

26. The nucleic acid molecule of claim 1, wherein said RNA-dependent 
RNA polymerase is of viral origin. 

27. The nucleic acid molecule of claim 1, wherein said RNA-dependent 
RNA polymerase is of alphaviral origin. 

28. The nucleic acid molecule of claim 1, wherein said RNA-dependent 
RNA polymerase is derived from a Sindbis virus. 

29. The nucleic acid molecule of claim 1, wherein said RNA-dependent 
RNA polymerase is derived from a Semliki Forest virus. 

30. The nucleic acid molecule of claim 1, wherein said RNA-dependent 
RNA polymerase is derived from an Aura virus. 
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31. The nucleic acid molecule of claim 1, wherein said RNA-dependent 
RNA polymerase is derived from a virus selected from the group 
consisting of Bebaru virus, Cabassou virus, Chikungunya virus, Easter 
equine encephalomyelitis virus, Fort morgan virus, Getah virus, 
Kyzylagach virus, Mayoaro virus, Middleburg virus, Mucambo virus, 
Ndumu virus, Pixuna virus, Tonate virus, Triniti virus, Una virus, 
Western equine encephalomyelitis virus, Whataroa virus, Venezuelan 
equine encephalomyelitis virus (VEE), and Ross River virus. 

32. The nucleic acid molecule of claim 3, wherein said temperature- 
sensitive RNA-dependent RNA polymerase has replicase activity at 
temperatures below 34°C which is at least five fold greater than the 
replicase activity exhibited at 34°C or above. 

33. The nucleic acid molecule of claim 3, wherein said temperature- 
sensitive RNA-dependent RNA polymerase has replicase activity at 
34°C which is at least five fold lower than the replicase activity 
exhibited at 29°C. 

34. The nucleic acid molecule of claim 1, wherein said second open 
reading frame encodes a cytokine, a lymphokine, a tumor necrosis 
factor, an interferon, a toxic polypeptide, a prodrug or a converting 
enzyme. 

35. The nucleic acid molecule of claim 1, wherein said second open 
reading frame encodes human p-interferon. 

36. The nucleic acid molecule of claim 1, wherein said second nucleotide 
sequence encodes an untranslated RNA molecule selected from the 
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group consisting of an antisense RNA molecule, a tRNA molecule, a 
rRNA molecule, and a ribozyme. 

An expression system comprising: 

(A) a first polynucleotide element which encodes an RNA 
molecule, said RNA molecule comprising: 

(a) at least one cw-acting sequence element; 

(b) a first nucleotide sequence comprising a first open 
reading frame, said first open reading frame having a 
nucleotide sequence encoding an RNA-dependent RNA 
polymerase; and 

(c) at least one second nucleotide sequence selected from 
the group consisting of: 

(i) a second open reading frame encoding a 
polypeptide; 

(ii) a nucleotide sequence complementary to all or a 
part of the second open reading frame of (i); and 

(iii) a nucleotide sequence encoding an untranslated . 
RNA molecule or complement thereof; 

wherein said second nucleotide sequence is operably 
linked to a promoter which is recognized by said RNA- 
dependent RNA polymerase; 

(B) a second polynucleotide element comprising an origin of 
replication; and 

(C) a third polynucleotide element encoding a replication initiation 
factor capable of recognizing said origin of replication. 

The expression system of claim 37, wherein said second open reading 
frame is in a translatable format after one or more RNA-dependent 
RNA replication events. 
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39. The expression system of claim 37, wherein said first, second and third 
polynucleotide elements are DNA elements. 

40. The expression system of claim 37, wherein said first, second and third 
polynucleotide elements are each present on a separate nucleic acid 
molecule. 

41. The expression system of claim 37, wherein said first, second and third 
polynucleotide elements are all present on a single nucleic acid 
molecule. 

42. The expression system of claim 37, wherein said first and said second 
polynucleotide elements are present on a first nucleic acid molecule, 
and said third polynucleotide element is present on a second nucleic 
acid molecule. 

43. The expression system of claim 42, wherein said second nucleic acid 
molecule further comprises a fourth polynucleotide element capable of 
promoting the replication of said second nucleic acid molecule. 

44. The expression system of claim 42, wherein said second nucleic acid 
sequence further comprises an origin of replication. 

45. The expression system of claim 37, wherein said RNA-dependent 
RNA polymerase is selected from the group consisting of: (a) a 
temperature-sensitive RNA-dependent RNA polymerase; (b) a non- 
cytopathic RNA-dependent RNA polymerase; and (c) a temperature- 
sensitive, non-cytopathic RNA-dependent RNA polymerase. 

46. The expression system of claim 37, wherein said origin of replication 
is derived from a prokaryotic organism or a eukaryotic organism. 
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47. The expression system of claim 37, wherein said origin of replication 
is derived from a virus, 

48. The expression system of claim 37, wherein said origin of replication 
is derived from a DNA virus. x 

49. The expression system of claim 37, wherein said origin of replication 
is derived from a DNA virus that allows for episomal replication. 

50. The expression system of claim 37, wherein said origin of replication 
is derived from a DNA virus selected from the group consisting of 
Herpesvirus, Papillomavirus, Polyomavirus, Adenovirus, and 
Hepadnaviruses. 

51. The expression system of claim 37, wherein said origin of replication 
is derived from a Herpesvirus. 

52. The expression system of claim 37, wherein said origin of replication 
is derived from Epstein-Barr virus (EBV). 

53. The expression system of claim 52, wherein said origin of replication 
is oriP. 

54. The expression system of claim 37, wherein said origin of replication 
is derived from an eukaryotic organism selected from the group 
consisting of yeast, insects and mammals. 

55. The expression system of claim 37, wherein said origin of replication 
is derived from a mammal. 



The expression system of claim 37, wherein said replication initiation 
factor is capable of operating as plasmid maintenance factor. 
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57. The expression system of claim 37, wherein said replication initiation 
factor is derived from a prokaryotic organism or a eukaryotic 
organism. 

58. The expression system of claim 37, wherein said replication initiation 
factor is derived from a virus. 

59. The expression system of claim 37, wherein said replication initiation 
factor is derived from a DNA virus. 

60. The expression system of claim 37, wherein said replication initiation 
factor is derived from a DNA virus selected from the group consisting 
of Herpesvirus, Papillomavirus, Polyomavirus, Adenovirus, and 
Hepadnavirus. 

61. The expression system of claim 37, wherein said replication initiation 
factor is derived from a Herpesvirus. 

62. The expression system of claim 37, wherein said replication initiation 
factor is derived from EBV. 

63. The expression system of claim 37, wherein said replication initiation 
factor is the EBNA-1 protein, or portion thereof. 

64. The expression system of claim 37, wherein said replication initiation 
factor and said origin of replication are derived from the same 
organism or the same virus. 

65. The expression system of claim 37, wherein said replication initiation 
factor and said origin of replication are derived from different 
organisms. 
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66. The expression system of claim 37, further comprising a fourth 
polynucleotide element, wherein said fourth polynucleotide element 
comprises a selection marker. 

67. The expression system of claim 66, wherein said selection marker 
confers resistance to Puromycin. 

68. The expression system of claim 37, wherein said RNA-dependent 
RNA polymerase is of viral origin. 

69. The expression system of claim 37, wherein said RNA-dependent 
RNA polymerase is of alphaviral origin. 

70. The expression system of claim 37, wherein said RNA-dependent 
RNA polymerase is derived from a Sindbis virus. 

71. The expression system of claim 37, wherein said RNA-dependent 
RNA polymerase is derived from a Semliki Forest virus. 

72. The expression system of claim 37, wherein said RNA-dependent 
RNA polymerase is derived from an Aura virus. 

73. The expression system of claim 37, wherein said RNA-dependent 
RNA polymerase is derived from a virus selected from the group 
consisting of Bebaru virus, Cabassou virus, Chikungunya virus, Easter 
equine encephalomyelitis virus, Fort morgan virus, Getah virus, 
Kyzylagach virus, Mayoaro virus, Middleburg virus, Mucambo virus, 
Ndumu virus, Pixuna virus, Tonate virus, Triniti virus, Una virus, 
Western equine encephalomyelitis virus, Whataroa virus, Venezuelan 
equine encephalomyelitis virus (VEE), and Ross River virus. 
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74. The expression system of claim 45, wherein said temperature-sensitive 
RNA-dependent RNA polymerase has replicase activity at 
temperatures below 34°C which is at least five fold greater than the 
replicase activity exhibited at 34°C or above. 

75. The expression system of claim 45, wherein said temperature-sensitive 
RNA-dependent RNA polymerase has replicase activity at 34°C which 
is at least five fold lower than the replicase activity exhibited at 29°C. 

76. The expression system of claim 37, wherein said second open reading 
frame encodes a cytokine, a lymphokine, a tumor necrosis factor, an 
interferon, a toxic protein, a prodrug or a converting enzyme. 

77. The expression system of claim 37, wherein said second open reading 
frame encodes human p-interferon. 

78. The expression system of claim 37, wherein said second nucleotide 
sequence encodes an untranslated RNA molecule selected from the 
group consisting of an antisense RNA molecule, a tRNA molecule, a 
rRNA molecule, and a ribozyme. 

79. A method of making a recombinant host cell comprising introducing 
the nucleic acid molecule of claim 1 into a host cell. 

80. The method of claim 79, wherein said RNA-dependent RNA 
polymerase is selected from the group consisting of: (a) a temperature- 
sensitive RNA-dependent RNA polymerase; (b) a non-cytopathic 
RNA-dependent RNA polymerase; and (c) a temperature-sensitive, 
non-cytopathic RNA-dependent RNA polymerase. 



-69- 

81. The method of claim 79, wherein said nucleic acid molecule is 
introduced by way of transfection. 

82. A method of making a recombinant host cell comprising introducing 
the expression system of claim 37 into a host cell. 

83. The method of claim 82, wherein said RNA-dependent RNA 
polymerase is selected from the group consisting of: (a) a temperature- 
sensitive RNA-dependent RNA polymerase; (b) a . non-cytopathic 
RNA-dependent RNA polymerase; and (c) a temperature-sensitive, 
non-cytopathic RNA-dependent RNA polymerase. 

84. The method of claim 82, wherein said expression system is introduced 
by way of transfection. 

85. A recombinant host cell produced by the method of claim 79. 

86. A recombinant host cell produced by the method of claim 82. 

87. A recombinant host cell comprising: 

(A) a first polynucleotide element which encodes an RNA 
molecule, said RNA molecule comprising: 

(a) at least one cis-acting sequence element; 

(b) a first nucleotide sequence comprising a first open 
reading frame, said first open reading frame having a 
nucleotide sequence encoding an RNA-dependent RNA 
polymerase; and 

(c) at least one second nucleotide sequence selected from 
the group consisting of: 

(i) a second open reading frame encoding a 
polypeptide; 
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(ii) a nucleotide sequence complementary to all or a 
part of the open reading frame of (i); and 

(iii) a nucleotide sequence encoding an untranslated 
RNA molecule or complement thereof; 

wherein said second nucleotide sequence is operably 
linked to a promoter which is recognized by said RNA- 
dependent RNA polymerase; 

(B) a second polynucleotide element comprising an origin of 
replication; and 

(C) a third polynucleotide element encoding a replication initiation 
factor capable of recognizing said origin of replication. . < 

88. The recombinant host cell of claim 87, wherein said second open 
reading frame is in a translatable format after one or more RNA- 
dependent RNA replication events. 

89. The recombinant host cell of claim 87, wherein said first, second and 
third polynucleotide elements are DNA elements. 

90. The recombinant host cell of claim 87, wherein said third 
polynucleotide is stably integrated into the genome of said host cell. 

9L The recombinant host cell of claim 87, wherein said first and said 
second polynucleotide elements are present on a first nucleic acid 
molecule, and said third polynucleotide element is present on a second 
nucleic acid molecule. 

92. The recombinant host cell of claim 87, further comprising a fourth 
polynucleotide comprising a selection marker. 

M. The recombinant host cell of claim 92, wherein said selection marker 
confers resistance to Puromycin. 
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94. The recombinant host cell of claim 91, wherein said second nucleic 
acid molecule further comprises a fifth polynucleotide element capable 
of promoting the replication of said second nucleic acid molecule. 

95. The recombinant host cell of claim 91, wherein said nucleotide 
sequence further comprises an origin of replication. 

96. The recombinant host cell of claim 87, wherein said first, second and^ 
third polynucleotide elements are provided on the same nucleic acid 
molecule. 

97. The recombinant host cell of claim 87, wherein said RNA-dependent 
RNA polymerase is setected from the group consisting of: (a) a 
temperature-sensitive RNA-dependent RNA polymerase; (b) a non- 
cytopathic RNA-dependent RNA polymerase; and (c) a temperature- 
sensitive, non-cytopathic RNA-dependent RNA polymerase. 

98. The recombinant host cell of claim 87, wherein said recombinant host 
cell is a mammalian cell. 

99. The recombinant host cell of claim 98, wherein said mammalian cell is 
selected from the group consisting of a human cell, a primate cell and a 
rodent cell. 

100. The recombinant host cell of claim 87, wherein said host cell 
comprises more than one copy of said third polynucleotide. 

101. The recombinant host cell of claim 100, wherein at least one copy of 
said third polynucleotide is stably integrated into the genome of said 
host cell. 
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102. A method for producing a polypeptide or untranslated RNA molecule, 
said method comprising: 

(a) introducing the nucleic acid molecule of claim 1 into a host cell 
to produce a recombinant host cell; and 

(b) culturing said recombinant host cell under conditions suitable 
for expression of said polypeptide or untranslated RNA 
molecule. 

103. The method of claim 102, further comprising recovering said 
polypeptide or untranslated RNA molecule. 

104. A method for producing a polypeptide or untranslated RNA molecule, 
said method comprising: 

(a) introducing the expression system of claim 37 into a host cell to 
produce a recombinant host cell; and 

(b) culturing said recombinant host cell under conditions suitable 
for expression of said polypeptide or untranslated RNA 
molecule. 

105. The method of claim 104, further comprising recovering said 
polypeptide or untranslated RNA molecule. 

106. A method for regulating the expression of a polypeptide or an 
untranslated RNA molecule, said method comprising: 

(a) introducing the nucleic acid molecule of claim 3 into a host cell 
to produce a recombinant host cell; 

(b) growing said recombinant host cell under suitable culture 
conditions; and 

(c) changing the temperature of the recombinant host cell culture 
from: 

(i) a permissive temperature to a restrictive temperature, or 
. (ii) a restrictive temperature to a permissive temperature; 
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wherein said polypeptide or untranslated RNA molecule is encoded by 
said first polynucleotide. 

107. The method of claim 106, wherein said polypeptide is a polypeptide 
that is toxic to said host cell. 

108. A method for regulating the expression of a polypeptide or an 
untranslated RNA molecule, said method comprising: 

(a) introducing the expression system of claim 45 into a host cell to 
produce a recombinant host cell; 

(b) growing said recombinant host cell under suitable culture 
conditions; and 

(c) changing the temperature of the recombinant host cell culture 
from: 

(i) a permissive temperature to a restrictive temperature, or - 

(ii) a restrictive temperature to a permissive temperature; 
wherein said polypeptide or untranslated RNA molecule is encoded by 

said first polynucleotide. 

109. The method of claim 108, wherein said polypeptide is a polypeptide 
that is toxic to said host ceil. 
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GENE EXPRESSION SYSTEM 

ABSTRACT OF THE DISCLOSURE 

The present invention relates to compositions and methods that allow 
the production of polypeptides and untranslated RNA molecules in host cells. 
More specifically, the invention provides nucleic acid molecules, expression 
systems, recombinant host cells, methods and kits, which enable the 
production and/or isolation and/or purification of polypeptides and 
untranslated RNA molecules. The compositions and methods of the invention 
can be applied in a regulatable expression system that is transiently transfected 
into mammalian host cells. 

47068.1 
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CTGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCC 
CTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAA - CGGGG 
GCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACSTAGTG 
GGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACT 
GGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAA 
TGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACARAATATTAACGCTTACAATTTCCATTCGCCATTCAGGCTGCG 
CAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGAT 
TAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAftAACGACGGCCAGTGAgcgcgcaattaaccctcactaa 
agggaacaaaagctggctagtgGATCCAGTCTTATGCAATACTCTTGTAGTCTTGCAACATGGTAACGATGAGTTAGCAA 
CATGCCTTACAAGGAGAGAAAAAGCACCGTGCyVTGCCGATTGGTGGAAGTAAGGTGGTACGATCGTGCCTTATTAGGAAG 
GCAACAGACGGGTCTGACATGGATTGGACGAACCACTGAATTOCGCATTGCAGAGATATTGTATTTAAGTGCCCTACCTc 
gataccgTCGAGATTGACGGCGTAGTACACACTATTGAATCAAACAGCCGACCAATTGCACTACCATCACAATGGAGAAG 
CCAGTAGTAAACGTAGACGTAGACCCCCAGAGTCCGTTTGTCGTGCAACTGCAAAAAAGCTTCCCGCAATTTGAGGTAGT 

AGCACAGCAGGTCACTCCAAATGACX^TGCTAATGGCAGAGCATTTTCGCATCTGG^ 

TTCCTACCACAGCGACGATCTTGGACATAGGCAGCGCACOGGCTCGTAGAATGTTTTCCGAGCACCAGTATCATTGTGTC 
TGCCCCATGCGTAGTCCAGAAGACCCGGACCGCATGATGAAATACGCCAGTAAACTGGCGGAAAAAGCGTGCAAGATTAC 
AAACAAGAACTTGCATGAGAAGATTAAGGATCTCCGGACCGTACTTGATACGCCGGATGCTGAAACACCATCGCTCTGCT 

ttcacaacgatgttacctgcaacatgcgtgccgaata^ 

tatcatcaggctatgaaaggcgtgcggaccctgtactggattggcttcgacaccacccagttcatgttctcggctatggc 
aggttcgtaccctgcgtac^caccaactgggcxjgacgagam^^ 

AGCTGAGTGAAGGTAGGACAGGAAAATTGTCGATAATGAGGAAGAAGGAGTTGAAGCCCGGGTCGCGGGTTTATTTCTCC 
GTAGGATCGACACTTTATCCAGAACACAGAGCCAGCTT^ 

GCAGTCGTACACTTGCCGCTGTGATACAGTGGTGAGTTGCGAAGGCTACGTAGTGAAGAAAATCACCATCAGTCCCGGGA 

TCACGGGAGAAACOGTGGGATACGCGGTTACACACAATAGCGAGGGCTTCTTGCTATGCAAAGTTACTGACACAGTAAAA 

GGAGAACGGGTATCGTTCCCTGTGTGCACGTACATCCCGGCX^CCATATGCGATCAGATGACTGGTATAATGGCCACGGA 

TATATCACCTGACGATGCACAAAAACTTCTGGTTGGGCTCAACCAGCGAATTGTCATTAACGGTAGGACTAACAGGAACA 

CCAACACCATGCAAAATTACCTTCTGCCGATCATAGCACAAGGGTTCAGCAAATGGGCTAAGGAGCGCAAGGATGATCTT 

GATAACGAGAAAATGCTGGGTACTAGAGAACGCAAGCTTACGTATGGCTGCTTGTGGGCGTTTCGCACTAAGAAAGTACA 

TTCGTTTTATCGCCCACCTGGAACGCAGACCTGC6TAAAAGTCCCAGCCTCTTTTAGCGCTTTTCCCATGTCGTCCGTAT 

GGACGACCTCTTTGCCCATGTCGCTGAGGCAGAAATTGAAACTGGCATTGCAACCAAAGAAGGAGGAAAAACTGCTGCAG 

GTCTCGGAGGAATTAGTCATGGAGGCCAAGGCTGCTTTTGAGGATGCTCAGGAGGAAGCCAGAGCGGAGAAGCTCCGAGA 

AGCACTTCCACCATTAGTGGCAGACAAAGGCATCGAGGCAGCCGCAGAAGTTGTCTGCGAAGTGGAGGGGCTCCAGGCGG 

AC^TCGGAGCAGCATTAGTTGAAACCCCGCGCGGl^C^TAAGGATAATACCTCAAGCAAATGACCGTATGATCGGACAG 

TATATCGTTGTCTCGCCAAACTCTGTGCTGAAGAATGCCAAACTCGCACCAGCGCACCCGCTAGCAGATCAGGTTAAGAT 

CATAACACACTCCGGAAGATCAGGAAGGTACGCGGTCGAACCATACGACGCTAAAGTACTGATGCCAGCAGGAGGTGCCG 

TACCATGGCCAGAATTCCTAGCACTGAGTGAGAGCGCCACGTTAGTGTACAACGAAAGAGAGTTTGTGAACCGCAAACTA 

TACCACATTGCGATGCATGGCCCCGCCAAGAATACAGAAGAGGRGCACTACAAGGTTACAAAGGCAGAGCTTGCAGAAAC 

AGAGTACGTGTTTGACGTGGACAAGAAGCGTTGCGTTAAGAAGGAAGAAGCCTCAGGTCTGGTCCTCTCGGGAGAACTGA 

CCAACCCTCCCTATCATGAGCTAGCTCTGGAGGGACTGAAGACCCGACCT 

GTGATAGGC^CACCGGGGTCGGGCAAGTCAGCTATTATCAAGTCAACTGTCACGGCACGAGATCTTGTTACCAGCGGAAA 
GAAAGAAAATTGTCGCGAAATTGAGGCCGACGTGCTAAGACTGAGGGGTATGCAGATTACGTCGAAGACAGTAGATTCGG 
TTATGCTCAACGGATGCCACAAAGCCGTAGAAGTGCTGTACGTTGACGAAGCGTTCGCGTGCCACGCAGGAGCACTACTT 
GCCTTGATTGCTATCGTCAGGCCCCGCAAGAAGGTAGTACTATGCGGAGACCCCATGCAATGCGGATTCTTCAACATGAT 
GCAACTAAAGGTACATTTCAATCACCCTGAAAAAGACATATGCACCAAGACATTCTACAAGTATATCTCCCGGCGTTGCA 



CACAGCCAGTTACAGCTATTGTATCGACACTGCATTACGATGGAAAGATGAAAACCACGAACCCGTGCAAGAAGAACATT 
GAAATCGATATTACAGGGGCCACAAAGCCGAAGCCAGGGGATATCATCCTGACATGTTTCCGCGGGTGGGTTAAGCAATT 
GCAAATCGACTATCCCGGACATGAAGTAATGACAGCCGCGGCCTCACAAGGGCTAACCAGAAAAGGAGTGTATGCCGTCC 
GGCAAAAAGTCAATGAAAACCCACTGTACGCGATCACATCAGAGCATGTGAACGTC 

CTAGTGTGGAAAACCTTGCAGGGCGACCCATGGATTAAGCAGCCCACTAACATACCTAAAGGAAACTTTCAGGCTACTAT 

AGAGGACTGGGAAGCTGAACACAAGGGAATAATTGCTGCAATAAACAGCCCCACTCCCCGTGCCAATCCGTTCAGCTGCA 

AGACCAACGTTTGCTGGGCGA7UVGCATTGGAACCGATACTAGCCACGGCCGGTATCGTACTTACCGGTTGCCAGTGGA 

GAACTGTTCCCACAGTTTGCGGATGACAAACCACATTCGGCCATTTACGCCTTAGACGTAATTTGCATTAAGTTTTTCGG 

CATGGACTTGACAAGCGGACTGTTTTCTAAACAGAGCATCXX^ 

CTCATTGGGACAACAGCCCAGGAACCCGCAAGTATGGGTACGATC^CGCCATTGCCGCCGAACTCTCCCGTAGATTTCX^G 

GTGTTCCAGCTAGCTGGGAAGGGCACACAACTTGATTTGCAGACX3GGGAGAACCAGAGTTATCTCTGCA 

GGTCCCGGTGAACCGCAATCTOCCTC^CGCCTTAGTCCCCGAGTACAA 

TGAACCAGTTCAAACACCACTCAGTACTTGTGGTATCAGAGGAAAAAATTGAAGCTCCCCGTAAGAGAATC^ 

GCCXXX3ATTGGCATAGCCGGTGCAGATAAGAACTACAACCTGGCT 

GTTCATCAACATTGGAACTAAATAC^GAAACCACCACTTTCAGC^^ 

CGCGTTCGGCCCTgaattgTTtAaacTcaggaggcacCCTCGTGGTGAAGTCCTATGGCTACGCCGACCGCAACAGTGAG 
GACGTAGTCACCGCTCTTGCCAGAAAGTTTGTCAGGGTGTCTGCAGCGAGACCAGATTGTGTCTCAAGCAATACAGAAAT 
GTACCTGATTTTCCGACAACTAGACAACAGCCGTACACGGCAATTCACCCCGCACCATCTGAATTGCGTGATTTCGTCCG 
TGTATGAGGGTACAAGAGATGGAGTTGGAGCCGCGCCGTCATACCGCACCAAAAGGGAGAATATTGCTGACTGTCAAGAG 
•GAAGCAGTTGTCAACGCAGCCAATCCGCTGGGTAGACCAGGCGAAGGAGTCTGCCGTGCCATCTATAAACGTTGGCCGAC 
CAGTTTTACCGATTCAGCCACGGAGACAGGCACCGCAAGAATGACTGTGTGCCTAGGAAAGAAAGTGATCCACGCGGTCG 
GCCCTGATTTCCGGAAGCACCCAGAAGCAGAAGCCTTGAAATTGCTACAAAACGCCTACCATGCAGTGGCAGACTTAGTA 
AATGAACATAACATCAAGTCTGTCGCC^TTCCACTGCTAT^ 

ATCACTTAACTGCTTGACAACCGCGCTAGACAGAACTGACGCGGACGTAACCATCTATTGCCTGGATAAGAAGTGGAAGG 

AAAGAATCGACGCGGC^CTCCAACTTAAGGAGTCTGTAAC^GAGCTGAAGGATGAAGATATGGAGATCGACGATGAGTTA 

GTATGGATtCATCCAGACAGTTGCTTGAAGGGAAGTU^AGGGATTCAGTACTACAAAAGGAAAATTGTATTCGTACTTCGA 

AGGCACCAAATTCCATCAAGCAGCAAAAGACATGGCGGAGATAAAGGTCCTGTTCCCTAATGACCAC^SAAAGTAATGAAC 

AACTGTGTGCCTACATATTGGGTGAGACCATGGAAGCAATCCGCGAAAAGTGCCCGGTCGACCATAACCCGTCGTCTAGC 

CCGCCCAAAACGTTGCCGTGCCTTTGCATGTATGO^^ 

AGAAGTTACAGTATGCTCCTCCACCCCCCT^ 

TCCTGTTTAATCCGCACACTCCCGC^TTCGTTCC 

GCACAGGCCGAGGAGGCCCCCGAAGTTGTAGCGACACCGTC^C^ 

CATCTCACTGGATATGGATGACAGTAGCGAAGGCTCACTTTTTTCGAGCTTTAGCGGATCGGACAACTCTATTACTAGTA 
TGGACAGTTGGTCGTC^GGACCTAGTTGACTAGAGRTAG 

CAAGAGCCTGCCCCTATTCC^CCGCCAAGGCTAAAGAAGATGGCCCGCCTGGCAGCG 

GGCAAGCAATAGCTCTGAGTCCCTCCACCTCTCTTTTGGTGGGGTATCCATGTCCCTCGGATCAATTTTCGACGGAGAGA 
CGGCCCGCCAGGC^GTOGTACAACCCCTGGCAAC^GGCCCXIACGGATGTGCCTATGTCT 

GAGATTGATGAGCTGAGCCGCAGAGTAACTGAGTCCGAACCC^TCCTGTTTGGATCATTTGAACCGGGCGAAGTGAACTC 
AATTATATCGTCCCGATCAGCCGTATCTTTTCXSACT^ 

gactaaccggggtaggtgggtacatattttcgacggaca^ 

cagcttacagaaccgaccttggagcgcaatgtcctggaaagaattcatgccccggtgc^ 

actovaactcaggtaccagatgatgcc^ 

ccataaccactgagcgactactgtcaggactacgactgtataactctgccacagatcagccagaatgctataagatcacc 
tatccgaaaccattgtactccagtagcgt^^ 



TCTGCATGAGAACTATCCGACAGTAGCATCTTATCAGATTACTGACGAGTACGATGCTTACTTGGATATGGTAGACGaGA 
CAGTCGCaTGCCTGGATACTGCAACCTTCTGCCCCGCTAAGCTTAGAAGTTACCCGAAAAAACATGAGTATAGAGCCCCG 
AATATCCGCAGTGCGGTTCCATCAGCGATGCAGAACACGCTACAAA 

CGTCACGCAGATGCGTGAACTGCCAACACTGGACTCAGCGACATTCAATGTCGAATGCTTTCGAAAATATGCATGTAATG 
ACGAGTATTGGGAGGAGTTCGCTCGGAAGCCAATTAGGATTACCACTGAGTTTGTCACCGCATATGTAGCTAGACTGAAA 
GGCCCTAAGGCCGCCGCACTATTTGCAAAGACGTATAATTTGGTCCCATTGCAAGAAGTGCCTATGGATAGATTCGTCAT 
GGACATGAAAAGAGACGTGAAAGTTACACCAGGCACX3AAACACACAGAAGAAAGACCGAAAGTACAAGTGATACAAGCCG 
CAGAACCCCTGGCGACTGCTTACTTATGCGGGA 

ATTCACACGCTTTTTGACATGTCGGCGGAGGATTTTGATGCAATCATAGCAGAACACTTCAAGCAAGGCGACCCGGTACT 
GGAGACGGATATCGCATCATTCGACAAAAGCCAAGACGACGCTATGGCGTTAACCGGTCTGATGATCTTGGAGGACCTGG 
GTGTGGATCAACCACTACTCGACTTGATCGAGTGCGCCTTTGGAGAAATATCATCCACCCATCTACCTACGGGTACTCGT 
TTTAAATTCGGGGCGATGATGAAATCCGGAATGTTCCTCAC^CTTTTTC 

CAGAGTACTAGAAGAGCGGCXTAAAACGTCCAGATGTGCAGCGTTCATTGGCGACGACAACATCATACATGGAGTAGTAT 
CTGACAAAGAAATGGCTGAGAGGTGCGCCACCTGGCTCAACATGGAGGTTAAGATCATCGACGCAGTCATCGGTGAGAGA 
CCACCTTACTTCTGCGGCGGATTTATCTTGCAAGATTCGGTTACT 

GCTGTTTAAGTTGGGTAAACCGCTCCCAGCCGACGACGAGCAAGACGAAGACAGAAGACGCGCTCTGCTAGATGAAACAA 
AGGCGTGGTTTAGAGTAGGTATAACAGGCACTTTAGCAGTGGCCGTGACGACCCGGTATGAGGTAGACAATATTACACCT 
* GTCCTACTGGCATTGAGAACTTTTGCCXJAGAGCAAAAGAGCAT^ 

TGGTCCTAAATAGTCAGCATAGTACATTTCATCTGACTAATACTACAACACCACCACCtctagaCGCGTAGAtctcacgt 

gagcatgcaggccttgggCCCAATGATCOSACCAGCAAAACrCGATGTACTTCCGAGGAACTGATGTGCATAATG 

GGCTGGTACATTAGATCCCCGCTTACCGCGGGCAATATAGCAACACTAAAAACTCGATGTACTTCCGAGGAAGCGCAGTG 

CATAATGCTGCGCAGTGTTGCCACATAACCACTATATTAACCATTTATCTAGCGGACGCXAAAAACTCAATGTATTTCTG 

AGGAAGCGTGGTGCATAATGCCACGCAGCGTCTGCATAACTTTTATTATTTCTTTTATTAATCAACAAAATTTTGTTTTT 

AAC^TTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 

GTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCA^^ 

CTCATCAATGTATCTTATCATGTCTGGATCCGTCGAGACGCGTccaattcgccctatagtgagtcgtattacgcgcgcTT 
GGCGTAATCATGGTCATAGCTGTTTCCTGTGTG7\AATTGTTATCCGCTCACAATTCCACACAACAT?-.CGAGCCGGAAGCA 
TAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCG 
GGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGC 
TTCCTCGCTCACTGACTOSC^ 

TATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCA 
GCX3TTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCT 

CCOGAC^GGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTA 
CCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTG 
TAGGTOSTTCGCTCCAAGCTGGGCTGTGT^^ 

TCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCX3A 

TAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTG 

CTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAAC^ 

TGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTQUVG^ 

AGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAA 

AAATGAAGTT TT AAAT C AATCTAAAGTAT ATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACC 
TATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCT 
TACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCA 
GCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGGAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGG/^^ 




AGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTG 
GTATGGCTTCATTCA6CTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGC 
TCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTC 
TCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGC 



GGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACC 
CAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAA7VACAGGAAGGCAAAATGCCGCAAAAAAGG 
GAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGT 
CTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCC 
AC 



GGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATT 
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CTGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGA 
CCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCrrCCCTTCCTTTCTCG 

ccacgttcgccggcggatccggccattagccatattattcattggttatatagcataaat 

caatattggctattggccattgcatacgttgtatccatatcataatatgtacatttatat 

tggctcatgtccaacattaccgccatgttgacattgattattgactagttattaatagta 

atcaattacggggtcattagttcatagcccatatatggagttccgcgttacataacttac 

ggtaaatggcccgcctggctgaccgccx:aacgacccccgcccattgacgtcaataatgac 

gtatgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtattt 

acggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccctat 

tgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatggga 

ctttcctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggtt 

ttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctcca 

ccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatg 

TCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTA 

TATAAGCAGAGCTCGTTTAGTGAACCGTATTGACGGCGTAGTACACACTATTGAATCAAA 

CAGCCGACCAATTGCACTACCATCACAATGGAGAAGCCAGTAGTAAACGTAGACGTAGAC 

CCCCAGAGTCCGTTTGTCGTGCAACTGCAAAAAAGCTTCCCGCAATTTGAGGTAGTAGCA 

CAGCAGGTCACTCCAAATGACCATGCTAATGCCAGAGCATTTTCGCATCTGGCCAGTAAA 

CTAATCGAGCTGGAGGTTCCTACCACAGCGACGATCTTGGACATAGGCAGCGCACCGGCT 

cgtagaatgttttccgagcaccagtatcattgtgtctgccccatgcgtagtccagaagac 

ccggaccgcatgatgaaatatgccagtaaactggcggaaaaagcgtgcaagattacaaac 

aagaacttgcatgagaagattaaggatctccggaccgtacttgatacgccggatgctgaa 

acaccatcgctctgctttcacaacgatgttacctgcaacatgcgtgccgaatattccgtc 

atgcaggacgtgtatatcaacgctcccggaactatctatcatcaggctatgaaaggcgtg 

cggaccctgtactggattggcttcgacaccacccagttcatgttctcggctatggcaggt 

tcgtaccctgcgtacaacao:aactgggccgacgagaaagtccttgaagcgcgtaacatc 

ggactttgcagcacaaagctgagtgaaggtaggacaggaaaattgtcgataatgaggaag 

AAGGAGTTGAAGCCCGGGTCGCGGGTTTATTTCTCCGTAGGATCGACACTTTATCCAGAA 

CACAGAGCCAGCTTGCAGAGCTGGCATCTTCCATCGGTGTTCCACTTGAATGGAAAGCAG 

TCGTACACTTGCCGCTGTGATACAGTGGTGAGTTGCGAAGGCTACGTAGTGAAGAAAATC 

ACCATCAGTCCCGGGATCACGGGAGAAACCGTGGGATACGCGGTTACACACAATAGCGAG 

GGCTTCTTGCTATGCAAAGTTACTGACACAGTAAAAGGAGAACGGGTATCGTTCCCTGTG 

TGCACGTACATCCCGGCCACCATATGCG ATC AG ATG ACTG GTATA ATGGCC ACG G ATATA 

TCACCTGACGATGCACAAAAACTTCTGGTTGGGCTCAACCAGCGAATTGTCATTAACGGT 

AGGACTAACAGGAACACCAACACCATGCAAAATTACCTTCTGCCGATCATAGCACAAGGG 

TTCAGCAAATGGGCTAAGGAGCGCAAGGATGATCTTGATAACGAGAAAATGCTGGGTACT 

AGAG AACGCAAGCTTACGTATGGtTGCTTGTGGGCGTTTCGCACTAAGAAAGTACATTCG 

TTTTATCGCCCACCTGGAACGCAGACCTGCGTAAAAGTCCCAGCCTCTTTTA^ 

CCCATGTCGTCCGTATGGACGACCTCTTTGCCCATGTCGCTGAGGCAGAAATTGAAACTG 

GCATTGCAACCAAAGAAGGAGGAAAAACTGCTGCAGGTCTCGGAGGAATTAGTCATGGAG 

GCCAAGGCTGCTTTTGAGGATGCTCAGGAGGAAGCCAGAGCGGAGAAGCTCCGAGAAGCA 

CTTCCACCATTAGTGGCAGACAAAGGCATCGAGGCAGCCGCAGAAGTTGTCTGCGAAGTG 

GAGGGGCTCCAGGCGGACATCGGAGCAGCATTAGTTGAAACCCCGCGCGGTCACGTAAGG 

ATAATACCTCAAGCAAATGACCGTATGATCGGACAGTATATCGTTGTCTCGCCAAACTCT 

GTGCTGAAGAATGCCAAACTCGCACCAGCGCACCCGCTAGCAGATCAGGTTAAGATCATA 

ACACACTCCGGAAGATCAGGAAGGTACGCGGTCGAACCATACGACGCTAAAGTACTGATG 

CCAGCAGGAGGTGCCGTACCATGGCCAGAATTCCTAGCACTGAGTGAGAGCGCCACGTTA 

GTGTACAACGAAAGAGAGTTTGTGAACCGCAAACTATACCACATTGCCATGCATGGCCCC 

GCCAAGAATACAGAAGAGGAGCAGTACAAGGTTACAAAGGCAGAGCTTGCAGAAACAGAG 

TACGTGTTTGACGTGGACAAGAAGCGTTGCGTTAAGAAGGAAGAAGCCTCAGGTCTGGTC 

CTCTCGGGAGAACTGACCAACCCTCCCTATCATGAGCTAGCTCTGGAGGGACTGAAGACC 

CGACCTGCGGTCCCGTACAAGGTCGAAACAATAGGAGTGATAGGCACACCGGGGTCGGGC 

AAGTCAGCTATTATCAAGTCAACTGTCACGGCACGAGATCTTGTTACCAGCGGAAAGAAA 

GAAAATTGTCGCGAAATTGAGGCCGACGTGCTAAGACTGAGGGGTATGCAGATTACGTCG 

AAGACAGTAGATTCGGTTATGCTCAACGGATGCCACAAAGCCGTAGAAGTGCTGTACGTT 

GACGAAGCGTTCGCGTGCCACGCAGGAGCACTACTTGCCTTGATTGCTATCGTCAGGCCC 

CGCAAGAAGGTAGTACTATGCGGAGACCCCATGCAATGCGGATTCTTCAACATGATGCAA 

CTAAAGGTACATTTCAATCACCCTGAAAAAGACATATGCACCAAGACATTCTACAAGTAT 

ATCTCCCGGCGTTGCACACAGCCAGTTACAGCTATTGTATCGACACTGCATTACGATGGA 

AAGATGAAAACCACGAACCCGTGCAAGAAGAACATTGAAATCGATATTACAGGGGCCACA 
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AAGCCGAAGCCAGGGGATATCATCCTGACATGTTTCCGCGGGTGGGTTAAGCAATTGCAA 

ATCGACTATCCCGGACATGAAGTAATGACAGCCGCGGCCTCACAAGGGCTAACCAGAAAA 

GGAGTGTATGCCGTCCGGCAAAAAGTCAATGAAAACCCACTGTACGCGATCACATCAGAG 

CATGTGAACGTGTTGCTCACCCGCACTGAGGACAGGCTAGTGTGGAAAACCTTGCAGGGC 

GACCCATGGATTAAGCAGCTCACTAACATACCTAAAGGAAACTTTCAGGCTACTATAGAG 

GACTGGGAAGCTGAACACAAGGGAATAATTGCTGCAATAAACAGCCCCACTCCCCGTGCC 

AATCCGTTCAGCTGCAAGACCAACGTTTGCTGGGCGAAAGCATTGGAACCGATACTAGCC 

ACGGCCGGTATCGTACTTACCGGTTGCCAGTGGAGCGAACTGTTCCCACAGTTTGCGGAT 

GACAAACCACATTCGGCCATTTACGCCTTAGACGTAATTTGCATTAAGTTTTTCGGCATG 

GACTTGACAAGCGGACTGTTTTCTAAACAGAGCATCCCACTAACGTACCATCCCGCCGAT 

TCAGCGAGGCCGGTAGCTCATTGGGACAACAGCCCAGGAACCCGCAAGTATGGGTACGAT 

CACGCCATTGCCGCCGAACTCTCCCGTAGATTTCCGGTGTTCCAGCTAGCTGGGAAGGGC 

ACACAACTTGATTTGCAGACGGGGAGAACCAGAGTTATCTCTGCACAGCATAACCTGGTC 

CCGGTGAACCGCAATCTTCCTCACGCCTTAGTCCCCGAGTACAAGGAGAAGCAACCCGGC 

CCGGTCGAAAAATTCTTGAACCAGTTCAAACACCACTCAGTACTTGTGGTATCAGAGGAA 

AAAATTGAAGCTCCCCGTAAGAGAATCGAATGGATCGCCCCGATTGGCATAGCCGGTGCA 

GATAAGAACTACAACCTGGCTTTCGGGTTTCCGCCGCAGGCACGGTACGACCTGGTGTTC 

ATCAACATTGGAACTAAATACAGAAACCACCACTTTCAGCAGTGCGAAGACCATGCGGCG 

ACCTTAAAAACCCTTTCGCGTTCGGCCCTGAATTGTTTAAACTCAGGAGGCACCCTCGTG 

GTGAAGTCCTATGGCTACGCCGACCGCAACAGTGAGGACGTAGTCACCGCTCTTGCCAGA 

AAGTTTGTCAGGGTGTCTQCAGCGAGACCAGATTGTGTCTCAAGCAATACAGAAATGTAC 

CTGATTTTCCGACAACTAGACAACAGCCGTACACGGCAATTCACCCCGCACCATCTGAAT 

TGCGTGATTTCGTCCGTGTATGAGGGTACAAGAGATGGAGTTGGAGCCGCGCCGTCATAC 

CGCACCAAAAGGGAGAATATTGCTGACTGTCAAGAGGAAGCAGTTGTCAACGCAGCCAAT 

CCGCTGGGTAGACCAGGCGAAGGAGTCTGCCGTGCCATCTATAAACGTTGGCCGACCAGT 

TTTACCGATTCAGCCACGGAGACAGGCACCGCAAGAATGACTGTGTGCCTAGGAAAGAAA 

GTGATCCACGCGGTCGGCCCTGATTTCCGGAAGCACCCAGAAGCAGAAGCCTTGAAATTG 

CTACAAAACGCCTACCATGCAGTGGCAGACTTAGTAAATGAACATAACATCAAGTCTGTC 

GCCATTCCACTGCTATCTACAGGCATTTACGCAGCCGGAAAAGACCGCCTTGAAGTATCA 

CTTAACTGCTTGACAACCGCGCTAGACAGAACTGACGCGGACGTAACCATCTATTGCCTG 

GATAAGAAGTGGAAGGAAAGAATCGACGCGGCACTCCAACTTAAGGAGTCTGTAACAGAG 

CTGAAGGATGAAGATATGGAGATCGACGATGAGTTAGTATGGATTCATCCAGACAGTTGC 

TTGAAGGGAAGAAAGGGATTCAGTACTACAAAAGGAAAATTGTATTCGTACTTCGAAGGC 

ACCAAATTCCATCAAGCAGCAAAAGACATGGCGGAGATAAAGGTCCTGTTCCCTAATGAC 

CAGGAAAGTAATGAACAACTGTGTGCCTACATATTGGGTGAGACCATGGAAGCAATCCGC 

GAAAAGTGCCCGGTCGACCATAACCCGTCGTCTAGCCCGCCCAAAACGTTGCCGTGCCTT 

TGCATGTATGCCATGACGCCAGAAAGGGTCCACAGACTTAGAAGCAATAACGTCAAAGAA 

GTTACAGTATGCTCCTCCACCCCCCTTCCTAAGCACAAAATTAAGAATGTTCAGAAGGTT 

CAGTGCACGAAAGTAGTCCTGTTTAATCCGCACACTCCCGCATTCGTTCCCGCCCGTAAG 

TACATAGAAGTGCCAGAACAGCCTACCGCTCCTCCTGCACAGGCCGAGGAGGCCCCCGAA 

GTTGTAGCGACACCGTCACCATCTACAGCTGATAACACCTCGCTTGATGTCACAGACATC 

TCACTGGATATGGATGACAGTAGCGAAGGCTCACTTTTTTCGAGCTTTAGCGGATCGGAC 

AACTCTATTACTAGTATGGACAGTTGGTCGTCAGGACCTAGTTCACTAGAGATAGTAGAC 

CGAAGGCAGGTGGTGGTGGCTGACGTTCATGCCGTCCAAGAGCCTGCCCCTATTCCACCG 

CCAAGGCTAAAGAAGATGGCCCGCCTGGCAGCGGCAAGAAAAGAGCCCACTCCACCGGCA 

AGCAATAGCTCTGAGTCCCTCCACCTCTCTTTTGGTGGGGTATCCATGTCCCTCGGATCA 

ATTTTCGACGGAGAGACGGCCCGCCAGGCAGCGGTACAACCCCTGGCAACAGGCCCCACG 

GATGTGCCTATGTCTTTCGGATCGTTTTCCGACGGAGAGATTGATGAGCTGAGCCGCAGA 

GTAACTGAGTCCGAACCCGTCCTGTTTGGATCATTTGAACCGGGCGAAGTGAACTCAATT 

ATATCGTCCCGATCAGCCGTATCTTTTCCTCTACGCAAGCAGAGACGTAGACGCAGGAGC 

AGGAGGACTGAATACTGACTAACCGGGGTAGGTGGGTACATATTTTCGACGGACACAGGC 

CCTGGGCACTTGCAAAAGAAGTCCGTTCTGCAGAACCAGCTTACAGAACCGACCTTGGAG 

CGCAATGTCCTGGAAAGAATTCATGCCCCGGTGCTCGACACGTCGAAAGAGGAACAACTC 

AAACTCAGGTACCAGATGATGCCCACCGAAGCCAACAAAAGTAGGTACCAGTCTCGTAAA 

GTAGAAAATCAGAAAGCCATAACCACTGAGCGACTACTGTCAGGACTACGACTGTATAAC 

TCTGCCACAGATCAGCCAGAATGCTATAAGATCACCTATCCGAAACCATTGTACTCCAGT 

AGCGTACCGGCGAACTACTCCGATCCACAGTTCGCTGTAGCTGTCTGTAACAACTATCTG 

CATGAGAACTATCCGACAGTAGCATCTTATCAGATTACTGACGAGTACGATGCTTACTTG 

GATATGGTAGACGAGACAGTCGCATGCCTGGATACTGCAACCTTCTGCCCCGCTAAGC^ 

AGAAGTTACCCGAAAAAACATGAGTATAGAGCCCCGAATATCCGCAGTGCGGTTCCATCA 
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GCGATGCAGAACACGCTACAAAATGTGCTCATTGCCGCAACTAAAAGAAATTGCAACGTC 

ACGCAGATGCGTGAACTGCCAACACTGGACTCAGCGACATTCAATGTCGAATGCTTTCGA 

AAATATGCATGTAATGACGAGTATTGGGAGGAGTTCGCTCGGAAGCCAATTAGGATTACC 

ACTGAGTTTGTCACCGCATATGTAGCTAGACTGAAAGGCCCTAAGGCCGCCGCACTATTT 

GCAAAGACGTATAATTTGGTCCCATTGCAAGAAGTGCCTATGGATAGATTCGTCATGGAC 

ATGAAAAGAGACGTGAAAGTTACACCAGGCACGAAACACACAGAAGAAAGACCGAAAGTA 

CAAGTGATACAAGCCGCAGAACCCCTGGCGACTGCTTACTTATGCGGGATTCACCGGGAA 

TTAGTGCGTAGGCTTACGGCCGTCTTGCTTCCAAACATTCACACGCITTTTGACATGTCG 

GCGGAGGATTTTGATGCAATCATAGCAGAACACTTCAAGCAAGGCGACCCGGTACTGGAG 

ACGGATATCGCATCATTCGACAAAAGCCAAGACGACGCTATGGCGTTAACCGGTCTGATG 

ATCTTGGAGGACCTGGGTGTGGATCAACCACTACTCGACTTGATCGAGTGCGCCTTTGGA 

GAAATATCATCCACCCATCTACCTACGGGTACTCGTTTTAAATTCGGGGCGATGATGAAA 

TCCGGAATGTTCCTCACACTTTTTGTCAACACAGTTTTGAATGTCGTTATCGCCAGCAGA 

GTACTAGAAGAGCGGCTTAAAACGTCCAGATGTGCAGCGTTCATTGGCGACGACAACATC 

ATACATGGAGTAGTATCTGACAAAGAAATGGCTGAGAGGTGCGCCACCTGGCTCAACATG 

GAGGTTAAGATCATCGACGCAGTCATCGGTGAGAGACCACCTTACTTCTGCGGCGGATTT 

ATCTTGCAAGATTCXjGTTACTTCCACAGCGTGCCGCGTGGCGGATCCCCTGAAAAGGCTG 

TTTAAGTTGGGTAAACCGCTCCCAGCCGACGACGAGCAAGACGAAGACAGAAGACGCGCT 

CTGCTAGATGAAACAAAGGCGTGGTTTAGAGTAGGTATAACAGGCACTTTAGCAGTGGCC 

GTGAGGACCCGGTATGAGGTAGACAATATTACACCTGTCCTACTGGCATTGAGAACTTTT 

GCCCAGAGCAAAAGAGCATTCCAAGCCATCAGAGGGGAAATAAAGCATCTCTACGGTGGT 

CCTAAATAGTCAGCATAGTACATTTCATCTGACTAATACCACAACACCACCACCTCTAGA 

TTAATTAACTCGAGGCGCGCCGGGCCCAGAGGATCTTTGTGAAGGAACCTTACTTCTGTG 

GTGTGACATAATTGGACAAACTACCTACAGAGATTTAAAGCTCTAAGGTAAATATAAAAT 

TTTTAAGTGTATAATGTGTTAAACTACTGATTCT 

CCTATGGAACTGATGAATGGGAGCAGTGGTGGAATGCCTTTAATGAGGAAAACCTGTTTT 

GCTCAGAAGAAATGCCATCTAGTGATGATGAGGCTACTGCTGACTCTCAACATTCTACTC 

CTCCAAAAAAGAAGAGAAAGGTAGAAGACCCCAAGGACTTTCCTTCAGAATTGCTAAGTT 

TTTTGAGTCATGCTGTGTTTAGTAATAGAACTCTTGCTTGCrTTGCTATTTACACCACAA 

AGGAAAAAGCTGCACTGCTATACAAGAAAATTATGGAAAAATATTTGATGTAGCGGCCCA 

ATGATCCGACCAGCAAAACTCGATGTACTTCCGAGGAACTGATGTGCATAATGCATCAGG 

CTGGTACATTAGATCCCCGCTTACCGCGGGCAATATAGCAACACTAAAAACTCGATGTAC 

TTCCGAGGAAGCGCAGTGCATAATGCTGCGCAGTGTTGCCACATAACCACTATATTAACC 

ATTTATCTAGCGGACGCCAAAAACTCAATGTATTTCTGAGGAAGCGTGGTGCATAATGCC 

ACGCAGCGTCTGCATAACTTTTATTATTTCTTTTATTAATCAACAAAAT^^ 

CATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGGAATTCCCAACTTGTTTAT 

TGCA GCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATT 

TTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTG 

GATCCGTCGAGACGCGTGCGGCCGCGGGAATTCGATCTTTATGTGTAACTCTTGGCTGAA 

GCTCTTACACCAATGCTGGGGGACATGTACCTCCCAGGGGCCCAGGAAGACTACGGGAGG 

CTACACCAACGTCAATCAGAGGGGCCTGTGTAGCTACCGATAAGCGGACCCTCAAGAGGG 

CATTAGCAATAGTGTTTATAAGGCCCCCTTGTTAACCCTAAACGGGTAGCATATGCTTCC 

CGGGTAGTAGTATATACTATCCAGACTAACCCTAATTCAATAGCATATGTTACCCAACGG 

GAAGCATATGCTATCGAATTAGGGTTAGTAAAAGGGTCCTAAGGAACAGCGATATCTCCC 

ACCCCATGAGCTGTCACGGTTTTATTTACATGGGGTCAGGATTCCACGAGGGTAGTGAAC 

CATTTTAGTCACAAGGGCAGTGGCTGAAGATCAAGGAGCGGGCAGTGAACTCTCCTGAAT 

CTTCGCCTGCTTCTTCATTCTCCTTCGTTTAGCTAATAGAATAACTGCTGAGTTGTGAAC 

AGTAAGGTGTATGTGAGGTGCTCGAAAACAAGGTTTCAGGTGACGCCCCCAGAATAAAAT 

TTGGACGGGGGGTTCAGTGGTGGCATTGTGCTATGACACCAATATAACCCTCACAAACCC 

CTTGGGCAATAAATACTAGTGTAGGAATGAAACATTCTGAATATCTTTAACAATAGAAAT 

CCATGGGGTGGGGACAAGCCGTAAAGACTGGATGTCCATCTCACACGAATTTATGGCTAT 

GGGCAACACATAATCCTAGTGCAATATGATACTGGGGTTATTAAGATGTGTCCCAGGCAG 

GGACCAAGACAGGTGAACCATGTTGTTACACTCTATTTGTAACAAGGGGAAAGAGAGTGG 

ACGCCGACAGCAGCGGACTCCACTGGTTGTCTCTAACACCCCCGAAAATTAAACGGGGCT 

CCACGCCAATGGGGCCCATAAACAAAGACAAGTGGCCACTCTTTTTTTTGAAATTGTGGA 

GTGGGGGCACGCGTCAGCCCCCACACGCCGCCCTGCGGTTTTGGACTGTAAAATAAGGGT 

GTAATAACTTGGCTGATTGTAACCCCGCTAACCACTGCGGTCAAACCACTTGCCCACAAA 

ACCACTAATGGCACCCCGGGGAATACCTGCATAAGTAGGTGGGCGGGCCAAGATAGGGGC 

GCGATTGCTGCGATCTGGAGGACAAATTACACACACTTGCGCCTGAGCGCCAAGCACAGG 

GTTGTTGGTCCTCATATTCACGAGGTCGCTGAGAGCACGGTGGGCTAATGTTGCCATGGG 
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TAGCATATACTACCCAAATATCTGGATAGCATATGCTATCCTAATCTATATCTGGGTAGC 

ATAGGCTATCCTAATCTATATCTGGGTAGCATATGCTATCCTAATCTATATCTGGGTAGT 

ATATGCTATCCTAATTTATATCTGGGTAGCATAGGCTATCCTAATCTATATCTGGGTAGC 

ATATGCTATCCTAATCTATATCTGGGTAGTATATGCTATCCTAATCTGTATCCGGGTAGC 

ATATGCTATCCTAATAGAGATTAGGGTAGTATATGCTATCCTAATTTATATCTGGGTAGC 

ATATACTACCCAAATATCTGGATAGCATATGCTATCCTAATCTATATCTGGGTAGCATAT 

GCTATCCTAATCTATATCTGGGTAGCATAGGCTATCCTAATCTATATCTGGGTAGCATAT 

GCTATCCTAATCTATATCTGGGTAGTATATGCTATCCTAATTTATATCTGGGTAGCATAG 

GCTATCCTAATCTATATCTGGGTAGCATATGCTATCCTAATCTATATCTGGGTAGTATAT 

GCTATCCTAATCTGTATCCGGGTAGCATATGCTATCCTCATGCATATACAGTCAGCATAT 

GATACCCAGTAGTAGAGTGGGAGTGCTATCCTTTGCATATGCCGCCACCTCCCAAGGGGG 

CGTGAATTTTCGCTGCTTGTCCrTTTCCTGCATGCATCACTAGTGAATTCGCGG 

TAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCA 

ATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAA 

GCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCCCGCCCC 

TAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAA'lM'IM'riMTATTTATG 

CAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTTTG 

GAGGCCTAGGCTTTTGCAAAAAGCTTGCATGCCTGCAGGTCGGCCGCCACGACCGGTGCC 

GCCACCATCCCCTGACCCACGCCCCTGACCCCTCACAAGGAGACGACCTTCCATGACCGA 

GTACAAGCCCACGGTGCGCCTCGCCACCCGCGACGACGTCCCCCGGGCCGTACGCACCCT 

CGCCGCCGCGTTCGCCGACTACCCCGCCACGCGCCACACCGTCGACCCGGACCGCCACAT 

CGAGCGGGTCACCGAGCTGCAAGAACTCTTCCTCACGCGCGTCGGGCTCGACATCGGCAA 

GGTGTGGGTCGCGGACGACGGCGCCGCGGTGGCGGTCTGGACCACGCCGGAGAGCGTCGA 

AGCGGGGGCGGTGTTCGCCGAGATCGGCCCGCGCATGGCCGAGTTGAGCGGTTCCCGGCT 

GGCCGCGCAGCAACAGATGGAAGGCCTCCTGGCGCCGCACCGGCCCAAGGAGCCCGCGTG 

GTTCCTGGCCACCGTCGGCGTCTCGCCCGACCACCAGGGCAAGGGTCTGGGCAGCGCCGT 

CGTGCTCCCCGGAGTGGAGGCGGCCGAGCGCGCCGGGGTGCCCGCCTTCCTGGAGACCTC 

CGCGCCCCGCAACCTCCCCTTCTACGAGCGGCTCGGCTTCACCGTCACCGCCGACGTCGA 

GGTGCCCGAAGGACCGCGCACCTGGTGCATGACCCGCAAGCCCGGTGCCTGACGCCCGCC 

CCACGACCCGCAGCGCCCGACCGAAAGGAGCGCACGACCCCATGGCTCCGACCGAAGCCA 

CCCGGGGCGGCCCCGCCGACCCCGCACCCGCCCCCGAGGCCCACCGACTCTAGCTAGAGG 

ATCATAATCAGCCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACAC 

CTCCCCCTGAACCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACTTGTTTATTGCA 

GCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATTTTTT 

TCACTGCATTCTAGTTGTGGTTTGTCCAACGCGTCGGTACCAGCTTTTGTTCCCTTTAGT 

GAGGGTTAATTTCGAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTT 

ATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTG 

CCTAATGAGTGAGCTAACTCACATTAATTGCGTTTCGCTCACTGCCCGCTTTCCAGTCGG 

GAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGC 

GTATTGGGCGCTCTTCCGC1TCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCT 

ggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggata 

acgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccg 

cgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgct 

caagtcagaggtggcgaaacccx3acaggactataaagataccaggcgtttccccctggaa 

gctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacx:tgtccgcc 

tcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgt 

AGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCG 

ccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactgg 

cagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttct 

tgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgc 

tgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccg 

ctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctc 

aagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgtt 

aagggat tttg gtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaa 

aatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaat 

gcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcct 

gactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctg 

caatgataccgcgagacccacg(Hx:accggctccagatttatcagcaataaaccagccag 

ccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctatta 

attgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttg 
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CCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCG 

GTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCT 

CCITCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTA 

TGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTG 

GTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCC 

CGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTG 

GAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGA 

TGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTG 

GGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAAT 

GTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTC 

TCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCA 

CATTTCCCCGAAAAGTGCCAC 
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CTGACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGA 
CCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCITTCT^ 

CCACGTTCGCCGGCGGATCCGGCCATTAGCCATATTATTCATTGGTTATATAGCATAAAT 

CAATATTGGCTATTGGCCATTGCATACGTTGTATCX3ATATCATAATATGTACATTTATAT 

TGGCTCATGTCCAACATTACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTA 

ATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTAC 

GGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGAC 

GTATGTTCCCATAGTAACGCCAATAGGGACTTTCCATTGACGTCAATGGGTGGAGTATTT 

ACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATATGCCAAGTACGCCCCCTAT 

TGACGTCAATGACGGTAAATGG'CCCGCCTGGCATTATGCCCAGTACATGACCTTATGGGA 

CTTTCCTACTTGGCAGTACATCTACGTATTAGTCATCGCTATTACCATGGTGATGCGGTT 

TTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACGGGGATTTCCAAGTCTCCA 

CCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGGACTTTCCAAAATG 

TCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGGGAGGTCTA 

TATAAGCAGAGCTCGTTTAGTGAACCGTATTGACGGCGTAGTACACACTATTGAATCAAA 

CAGCCGACCAATTGCACTACCATCACAATGGAGAAGCCAGTAGTAAACGTAGACGTAGAC 

CCCCAGAGTCCGTTTGTCGTGCAACTGCAAAAAAGCTTCCCGCAATTTGAGGTAGTAGCA 

CAGCAGGTCACTCCAAATGACCATGCTAATGCCAGAGCATTTTCGCATCTGGCCAGTAAA 

CTAATCGAGCTGGAGGTTCCTACCACAGCGACGATCTTGGACATAGGCAGCGCACCGGCT 

CGTAGAATGTTTTCCGAGCACCAGTATCATTGTGTCTGCCCCATGCGTAGTCCAGAAGAC 

CCGGACCGCATGATGAAATATGCCAGTAAACTGGCGGAAAAAGCGTGCAAGATTACAAAC 

AAGAACTTGCATGAGAAGATTAAGGATCTCCGGACCGTACTTGATACGCCGGATGCTGAA 

ACACCATCGCTCTGCTTTCACAACGATGTTACCTGCAACATGCGTGCCGAATATTCCGTC 

ATGCAGGACGTGTATATCAACGCTCCCGGAACTATCTATCATCAGGCTATGAAAGGCGTG 

CGGACCCTGTACTGGATTGGCTTCGACACCACCCAGTTCATGTTCTCGGCTATGGCAGGT 

TCGTACCCTGCGTACAACACCAACTGGGCCGACGAGAAAGTCCTTGAAGCGCGTAACATC 

GGACTTTGCAGCACAAAGCTGAGTGAAGGTAGGACAGGAAAATTGTCGATAATGAGGAAG 

AAGGAGTTGAAGCCCGGGTCGCGGGTTTATTTCTCCGTAGGATCGACACTTTATCCAGAA 

CACAGAGCCAGCTTGCAGAGCTGGCATCTTCCATCGGTGTTCCACTTGAATGGAAAGCAG 

TCGTACACTTGCCGCTGTGATACAGTGGTGAGTTGCGAAGGCTACGTAGTGAAGAAAATC 

ACCATCAGTCCCGGGATCACGGGAGAAACCGTGGGATACGCGGTTACACACAATAGCGAG 

GGCTTCTTGCTATGCAAAGTTACTGACACAGTAAAAGGAGAACGGGTATCGTTCCCTGTG 

TGCACGTACATCCCGGCCACCATATGCGATCAGATGACTGGTATAATGGCCACGGATATA 

TCACCTGACGATGCACAAAAACTTCTGGTTGGGCTCAACCAGCGAATTGTCATTAACGGT 

AGGACTAACAGGAACACCAACACCATGCAAAATTACCTTCTGCCGATCATAGCACAAGGG 

TTCAGCAAATGGGCTAAGGAGCGCAAGGATGATCTTGATAACGAGAAAATGCTGGGTACT 

AGAGAACGCAAGCTTACGTATGGCTGCTTGTGGGCGTTTCGCACTAAGAAAGTACATTCG 

TTTTATCGCCCACCTGGAACGCAGACCTGCGTAAAAGTCCCAGCCTCTTTTAGCGCTTTT 

CCCATGTCGTCCGTATGGACGACCTCTTTGCCCATGTCGCTGAGGCAGAAATTGAAACTG 

GCATTGCAACCAAAGAAGGAGGAAAAACTGCTGCAGGTCTCGGAGGAATTAGTCATGGAG 

GCCAAGGCTGCTTTTGAGGATGCTCAGGAGGAAGCCAGAGCGGAGAAGCTCCGAGAAGCA 

CTTCCACCATTAGTGGCAGACAAAGGCATCGAGGCAGCCGCAGAAGTTGTCTGCGAAGTG 

GAGGGGCTCCAGGCGGACATCGGAGCAGCATTAGTTGAAACCCCGCGCGGTCACGTAAGG 

ATAATACCTCAAGCAAATGACCGTATGATCGGACAGTATATCGTTGTCTCGCCAAACTCT 

GTGCTGAAGAATGCCAAACTCGCACCAGCGCACCCGCTAGCAGATCAGGTTAAGATCATA 

ACACACTCCGGAAGATCAGGAAGGTACGCGGTCGAACCATACGACGCTAAAGTACTGATG 

CCAGCAGGAGGTGCCGTACCATGGCCAGAATTCCTAGCACTGAGTGAGAGCGCCACGTTA 

GTGTACAACGAAAGAGAGTTTGTGAACCGCAAACTATACCACATTGCCATGCATGGCCCC 

GCCAAGAATACAGAAGAGGAGCAGTACAAGGTTACAAAGGCAGAGCTTGPAGAAACAGAG 

TACGTGTTTGACGTGGACAAGAAGCGTTGCGTTAAGAAGGAAGAAGCCTCAGGTCTGGTC 

CTCTCGGGAGAACTGACCAACCCTCCCTATCATGAGCTAGCTCTGGAGGGACTGAAGACC 

CGACCTGCGGTCCCGTACAAGGTCGAAACAATAGGAGTGATAGGCACACCGGGGTCGGGC 

AAGTCAGCTATTATCAAGTCAACTGTCACGGCACGAGATCTTGTTACCAGCGGAAAGAAA 

GAAAATTGTCGCGAAATTGAGGCCGACGTGCTAAGACTGAGGGGTATGCAGATTACGTCG 

AAGACAGTAGATTCGGTTATGCTCAACGGATGCCACAAAGCCGTAGAAGTGCTGTACGTT 

GACGAAGCGTTCGCGTGCCACGCAGGAGCACTACTTGCCTTGATTGCTATCGTCAGGCCC 

CGCAAGAAGGTAGTACTATGCGGAGACCCCATGCAATGCGGATTCTTCAACATGATGCAA 

CTAAAGGTACATTTCAATCACCCTGAAAAAGACATATGCACCAAGACATTCTACAAGTAT 

ATCTCCCGGCGTTGCACACAGCCAGTTACAGCTATTGTATCGACACTGCATTACGATGGA 

AAGATGAAAACCACGAACCCGTGCAAGAAGAACATTGAAATCGATATTACAGGGGCCACA 
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AAGCCGAAGCCAGGGGATATCATCCTGACATGTTTCCGCGGGTGGGTTAAGCAATTOCAA 

ATCGACTATCCCGGACATGAAGTAATGACAGCCGCGGCCTCACAAGGGCTAACCAGAAAA 

GGAGTGTATGCCGTCCGGCAAAAAGTCAATGAAAACCCACTGTACGCGATCACATCAGAG 

CATGTGAACGTGTTGCTCACCCGCACTGAGGACAGGCTAGTGTGGAAAACCTTGCAGGGC 

GACCCATGGATTAAGCAGCTCACTAACATACCTAAAGGAAACTTTCAGGCTACTATAGAG 

GACTGGGAAGCTGAACACAAGGGAATAATTGCTGCAATAAACAGCCCCACTCCCCGTGCC 

AATCCGTTCAGCTGCAAGACCAACGTTTGCTGGGCGAAAGCATTGGAACCGATACTAGCC 

ACGGCCGGTATCGTACTTACCGGTTGCCAGTGGAGCGAACTGTTCCCACAGTTTGCGGAT 

GACAAACCACATTCGGCCATTTACGCCTTAGACGTAATTTGCATTAAGTTTTTCGGCATG 

GACTTGACAAGCGGACTGTTTTCTAAACAGAGCATCCCACTAACGTACCATCCCGCCGAT 

TCAGCGAGGCCGGTAGCTCATTGGGACAACAGCCCAGGAACCCGCAAGTATGGGTACGAT 

CACGCCATTGCCGCCGAACTCTCCCGTAGATTTCCGGTGTTCCAGCTAGCTGGGAAGGGC 

ACACAACTTGATTTGCAGACGGGGAGAACCAGAGTTATCTCTGCACAGCATAACCTGGTC 

CCGGTGAACCGCAATCTTCCTCACGCCTTAGTCCCCGAGTACAAGGAGAAGCAACCCGGC 

CCGGTCGAAAAATTCTTGAACCAGTTCAAACACCACTCAGTACTTGTGGTATCAGAGGAA 

AAAATTGAAGCTCCCCGTAAGAGAATCGAATGGATCGCCCCGATTGGCATAGCCGGTGCA 

GATAAGAACTACAACCTGGCTTTCGGGTTTCCGCCGCAGGCACGGTACGACCTGGTGTTC 

ATCAACATTGGAACTAAATACAGAAACCACCACTTTCAGCAGTGCGAAGACCATGCGGCG 

ACCTTAAAAACCCTTTCGCGTTCGGCCCTGAATTGTTTAAACTCAGGAGGCACCCTCGTG 

GTGAAGTCCTATGGCTACGCCGACCGCAACAGTGAGGACGTAGTCACCGCTCTTGCCAGA 

AAGTTTGTCAGGGTGTCTGCAGCGAGACCAGATTGTGTCTCAAGCAATACAGAAATGTAC 

CTGATTTTCCGACAACTAGACAACAGCCGTACACGGCAATTCACCCCGCACCATCTGAAT 

TGCGTGATTTCGTCCGTGTATGAGGGTACAAGAGATGGAGTTGGAGCCGCGCCGTCATAC 

CGCACCAAAAGGGAGAATATTGCTGACTGTCAAGAGGAAGCAGTTGTCAACGCAGCCAAT 

CCGCTGGGTAGACCAGGCGAAGGAGTCTGCCGTGCCATCTATAAACGTTGGCCGACCAGT 

TTTACCGATTCAGCCACGGAGACAGGCACCGCAAGAATGACTGTGTGCCTAGGAAAGAAA 

GTGATCCACGCGGTCGGCCCTGATTTCCGGAAGCACCCAGAAGCAGAAGCCTTGAAATTG 

CTACAAAACGCCTACCATGCAGTGGCAGACTTAGTAAATGAACATAACATCAAGTCTGTC 

GCC ATTCC ACTGCTATCTACAGG CATTTACGC AG CCGG A A AAG ACCGCCTTG A AGTATC A 

CTTAACTGCTTGACAACCGCGCTAGACAGAACTGACGCGGACGTAACCATCTATTGCCTG 

GATAAGAAGTGGAAGGAAAGAATCGACGCGGCACTCCAACTTAAGGAGTCTGTAACAGAG 

CTGAAGGATGAAGATATGGAGATCGACGATGAGTTAGTATGGATTCATCCAGACAGTTGC 

TTGAAGGGAAGAAAGGGATTCAGTACTACAAAAGGAAAATTGTATTCGTACTTCGAAGGC 

ACCAAATTCCATCAAGCAGCAAAAGACATGGCGGAGATAAAGGTCCTGTTCCCTAATGAC 

CAGGAAAGTAATGAACAACTGTGTGCCTACATATTGGGTGAGACCATGGAAGCAATCCGC 

GAAAAGTGCCCGGTCGACCATAACCCGTCGTCTAGCCCGCXICAAAACGTTGCCGTGCCTT 

TGCATGTATGCCATGACGCCAGAAAGGGTCCACAGACTTAGAAGCAATAACGTCAAAGAA 

GTTACAGTATGCTCCTCCACCCCCCTTCCTAAGCACAAAATTAAGAATGTTCAGAAGGTT 

CAGTGCACGAAAGTAGTCCTGTTTAATCCGCACACTCCCGCATTCGTTCCCGCCCGTAAG 

TACATAGAAGTGCCAGAACAGCCTACCGCTCCTCCTGCACAGGCCGAGGAGGCCCCCGAA 

GTTGTAGCGACACCGTCACCATCTACAGCTGATAACACCTCGCTTGATGTCACAGACATC 

TCACTGGATATGGATGACAGTAGCGAAGGCTCACTTTTTTCGAGCTTTAGCGGATCGGAC 

AACTCTATTACTAGTATGGACAGTTGGTCGTCAGGACCTAGTTCACTAGAGATAGTAGAC 

CGAAGGCAGGTGGTGGTGGCTGACGTTCATGCCGTCCAAGAGCCTGCCCCTATTCCACCG 

CCAAGGCTAAAGAAGATGGCCCGCCTGGCAGCGGCAAGAAAAGAGCCCACTCCACCGGCA 

AGCAATAGCTCTGAGTCCCTCCACCTCTCITTTGGTGGGGTATCCATGTCCCT 

ATTTTCGACGGAGAGACGGCCCGCCAGGCAGCGGTACAACCCCTGGCAACAGGCCCCACG 

GATGTGCCTATGTCTTTCGGATCGTTTTCCGACGGAGAGATTGATGAGCTGAGCCGCAGA 

GTAACTGAGTCCGAACCCGTCCTGTTTGGATCATTTGAACCGGGCGAAGTGAACTCAATT 

ATATCGTCCCGATCAGCCGTATCTTTTCCTCTACGCAAGCAGAGACGTAGACGCAGGAGC 

AGGAGGACTGAATACTGACTAACCGGGGTAGGTGGGTACATATTTTCGACGGACACAGGC 

CCTGGGCACTTGCAAAAGAAGTCCGTTCTGCAGAACCAGCTTACAGAACCGACCTTGGAG 

CGCAATGTCCTGGAAAGAATTCATGCCCCGGTGCTCGACACGTCGAAAGAGGAACAACTC 

AAACTCAGGTACCAGATGATGCCCACCGAAGCCAACAAAAGTAGGTACCAGTCTCGTAAA 

GTAGAAAATCAGAAAGCCATAACCACTGAGCGACTACTGTCAGGACTACGACTGTATAAC 

TCTGCCACAGATCAGCCAGAATGCTATAAGATCACCTATCCGAAACCATTGTACTCCAGT 

AGCGTACCGGCGAACTACTCCGATCCACAGTTCGCTGTAGCTGTCTGTAACAACTATCTG 

CATGAGAACTATCCGACAGTAGCATCTTATCAGATTACTGACGAGTACGATGCTTACTTG 

GATATGGTAGACGAGACAGTCGCATGCCTGGATACTGCAACCTTCTGCCCCGCTAAGCTT 

AGAAGTTACCCGAAAAAACATGAGTATAGAGCCCCGAATATCCGCAGTGCGGTTCCATCA 
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GCGATGCAGAACACGCTACAAAATGTGCTCATTGCCGCAACTAAAAGAAATTGCAACGTC 

ACGCAGATGCGTGAACTGCCAACACTGGACTCAGCGACATTCAATGTCGAATGCTTTCGA 

AAATATGCATGTAATGACGAGTATTGGGAGGAGTTCGCTCGGAAGCCAATTAGGATTACC 

ACTGAGTTTGTCACCGCATATGTAGCTAGACTGAAAGGCCCTAAGGCCGCCGCACTATTT 

GCAAAGACGTATAATTTGGTCCCATTGCAAGAAGTGCCTATGGATAGATTCGTCATGGAC 

ATGAAAAGAGACGTGAAAGTTACACCAGGCACGAAACACACAGAAGAAAGACCGAAAGTA 

CAAGTGATACAAGCCGCAGAACCCCTGGCGACTGCTTACTTATGCGGGATTCACCGGGAA 

TTAGTGCGTAGGCTTACGGCCGTCTTGCTTCCAAACATTCACACGCnTTTT 

GCGGAGGATTTTGATGCAATCATAGCAGAACACTTCAAGCAAGGCGACCCGGTACTGGAG 

ACGGATATCGCATCATTCGACAAAAGCCAAGACGACGCTATGGCGTTAACCGGTCTGATG 

ATCTTGG AGG ACCTGGGTGTG G ATC A ACCACTACTCGACTTG ATCG AGTG CGCCTTTG G A 

GAAATATCATCCACCCATCTACCTACGGGTACTCGTTTTAAATTCGGGGCGATGATGAAA 

TCCGGAATGTTCCTCACACTTTTTGTCAACACAGTTTTGAATGTCGTTATCGCCAGCAGA 

GTACTAGAAGAGCGGCTTAAAACGTCCAGATGTGCAGCGTTCATTGGCGACGACAACATC 

ATACATGGAGTAGTATCTGACAAAGAAATGGCTGAGAGGTGCGCCACCTGGCTCAACATG 

GAG GTTA AGATC ATCG ACGC AGTCATCGGTG AGAG ACCACCTTACTTCTG.CGG CG G ATTT 

ATCTTGCAAGATTCGGTTACTTCCACAGCGTGCCGCGTGGCGGATCCCCTGAAAAGGCTG 

TTTAAGTTGGGTAAACCGCTCCCAGCCGACGACGAGCAAGACGAAGACAGAAGACGCGCT 

CTGCTAGATGAAACAAAGGCGTGGTTTAGAGTAGGTATAACAGGCACTTTAGCAG TGGCC 

GTGACGACCCGGTATGAGGTAGACAATATTACACCTGTCCTACTGGCATTGAGAACTTTT 

GCCCAGAGCAAAAGAGCATTCCAAGCCATCAGAGGGGAAATAAAGCATCTCTACGGTGGT 

CCTAAATAGTCAGCATAGTACATTTCATCTGACTAATACCACAACACCACCACCTCTAGA 

TTAATTAACTCGAGGCGCGCCGGGCCCAGAGGATCTTTGTGAAGGAACCTTACTTCTGTG 

GTGTGACATAATTGGACAAACTACCTACAGAGATTTAAAGCTCTA AGGTA AATATAAAAT 

TTTTAAGTGTATAATGTGTTAAACTACTGATTCTAATTC 

CCTATGGAACTGATGAATGGGAGCAGTGGTGGAATGCCTTTAATGAGGAAAACCTGTTTT 

GCTCAGAAGAAATGCCATCTAGTGATGATGAGGCTACTGCTGACTCTCAACATTCTACTC 

CTCCAAAAAAGAAGAGAAAGGTAGAAGACCCCAAGGACTTTCCTTCAGAATTGCTAAGTT 

TTTTGAGTCATGCTGTGTTTAGTAATAGAACTCTTGCTTGCTTTGCTATTTACACCACAA 

AGGAAAAAGCTGCACTGCTATACAAGAAAATTATGGAAAAATATTTGATGTAGCGGCCCA 

ATGATCCGACCAGCAAAACTCGATGTACTTCCGAGGAACTGATGTGCATAATGCATCAGG 

CTGGTACATTAGATCCCCGCTTACCGCGGGCAATATAGCAACACTAAAAACTCGATGTAC 

TTCCGAGGAAGCGCAGTGCATAATGCTGCGCAGTGTTGCCACATAACCACTATATTAACC 

ATTTATCTAGCGGACGCCAAAAACTCAATGTATTTCTGAGGAAGCGTGGTGCATAATGCC 

ACGCAGCGTCTGCATAACTTTTATTAT^ 

CATTTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGGAATTCCCAACTTGTTTAT 

TGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGCATT 

TTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACrrCATCAATGTATCTTATCATGTCTG 

GATCCGTCGAGACGCGTGCGGCCGCGGGAATTCGATGTGGCTTTCATCCTGGAGCAGACT 

TTGGAGTCTGTGGACTGCAACACAACATTGCCTTTATGTGTAACTCTTGGCTGAAGCTCT 

TACACCAATGCTGGGGGACATGTACCTCCCAGGGGCCCAGGAAGACTACGGGAGGCTACA 

CCAACGTCAATCAGAGGGGCCTGTGTAGCTACCGATAAGCGGACCCTCAAGAGGGCATTA 

GCAATAGTGTTTATAAGGCCCCCTTGTTAACCCTAAACGGGTAGCATATGCTTCCCGGGT 

AGTAGTATATACTATCCAGACTAACCCTAATTCAATAGCATATGTTACCCAACGGGAAGC 

ATATGCTATCGAATTAGGGTTAGTAAAAGGGTCCTAAGGAACAGCGATATCTCCCACCCC 

ATGAGCTGTCACGGTTTTATTTACATGGGGTCAGGATTCCACGAGGGTAGTGAACCATTT 

TAGTCACAAGGGCAGTGGCTGAAGATCAAGGAGCGGGCAGTGAACTCTCCTGAATCTTCG 

CCTGCTTCTTCATTCTCCTTCGTTTAGCTAATAGAATAACTGCTGAGTTGTC 

GGTGTATGTGAGGTGCTCGAAAACAAGGTTTCAGGTGACGCCCCCAGAATAAAATTTGGA 

CGGGGGGTTCAGTGGTGGCATTGTGCTATGACACCAATATAACCCTCACAAACCCCTTGG 

GCAATAAATACTAGTGTAGGAATGAAACATTCTGAATATCTTTAACAATAGAAATCCATG 

GGGTGGGGACAAGCCGTAAAGACTGGATGTCCATCTCACACGAATTTATGGCTATGGGCA 

ACACATAATCCTAGTGCAATATGATACTGGGGTTATTAAGATGTGTCCCAGGCAGGGACC 

AAGACAGGTGAACCATGTTGTTACACTCTATTTGTAACAAGGGGAAAGAGAGTGGACGCC 

GACAGCAGCGGACTCCACTGGTTGTCTCTAACACCCCCGAAAATTAAACGGGGCTCCACG 

CCAATGGGGCCCATAAACAAAGACAAGTGGCCACTC 1 1 i 1 i ' I ' 1 ' i GAA ATTGTGGAGTGGG 

GGCACGCGTCAGCCCCCACACGCCGCCCTGCGGTTTTGGACTGTAAAATAAGGGTGTAAT 

AACTTGGCTGATTGTAACCCCGCTAACCACTGCGGTCAAACCACTTGCCCACAAAACCAC 

TAATGGCACCCCGGGGAATACCTGCATAAGTAGGTGGGCGGGCCAAGATAGGGGCGCGAT 

TGCTGCGATCTGGAGGACAAATTACACACACTTGCGCCTGAGCGCCAAGCACAGGGTTGT 



pCylTS OPE 



Fig. 9C 



TGGTCCTCATATTCACGAGGTCGCTGAGAGCACGGTGGGCTAATGTTGCCATGGGTAGCA 

TATACTACCC A A ATATCTGG ATAG CATATGCTATCCTA ATCTATATCTGG GTAGC ATAGG 

CTATCCTAATCTATATCTGGGTAGCATATGCTATCCTAATCTATATCTGGGTAGTATATG 

CTATCCTAATTTATATCTGGGTAGCATAGGCTATCCTAATCTATATCTGGGTAGCATATG 

CTATCCTAATCTATATCTGGGTAGTATATGCTATCCTAATCTGTATCCGGGTAGCATATG 

CTATCCTAATAGAGATTAGGGTAGTATATGCTATCCTAATTTATATCTGGGTAGCATATA 

CTACCCAAATATCTGGATAGCATATGCTATCCTAATCTATATCTGGGTAGCATATGCTAT 

CCTAATCTATATCTGGGTAGCATAGGCTATCCTAATCTATATCTGGGTAGCATATGCTAT 

CCTAATCTATATCTGGGTAGTATATGCTATCCTAATTTATATCTGGGTAGCATAGGCTAT 

CCTAATCTATATCTGGGTAGCATATGCTATCCTAATCTATATCTGGGTAGTATATGCTAT 

CCTAATCTGTATCCGGGTAGCATATGCTATCCTCATGCATATACAGTCAGCATATGATAC 

CCAGTAGTAGAGTGGGAGTGCTATCCTTTGCATATGCCGCCACCTCCCAAGGGGGCGTGA 

ATTTTCGCTGCTTGTCCTTTTCCTGCATGCTGGTTGCTCCCATTCT^ 

GAGGCCAGGCTAAAGCCGTCGCATGTCTGATTGCTCACCAGGTAAATGTCGCTAATGTTT 

TCCAACGCGAGAAGGTGTTGAGCGCGGAGCTGAGTGACGTGACAACATGGGTATGCCCAA 

TTGCCCCATGTTGGGAGGACGAAAATGGTGACAAGACAGATGGCCAGAAATACACCAACA 

GCACGCATGATGTCTACTGGGGATTTATTCTTTAGTGCGGGGGAATACACGGCTTTTAAT 

ACGATTGAGGGCGTCTCCTAACAAGTTACATCACTCCTGCCCTTCCTCACCCTCATCTCC 

ATCACCTCCTTCATCTCCGTCATCTCCGTCATCACCCTCCGCGGCAGCCCCTTCCACCAT 

AGGTGGAAACCAGGGAGGCAAATCTACTCCATCGTCAAAGCTGCACACAGTCACCCTGAT 

ATTGCAGGTAGGAGCGGGCTTTGTCATAACAAGGTCCTTAATCGCATCCTTCAAAACCTC 

AGCAAATATATGAGTTTGTAAAAAGACCATGAAATAACAGACAATGGACTCCCTTAGCGG 

GCCAGGTTGTGGGCCGGGTCCAGGGGCCATTCCAAAGGGGAGACGACTCAATGGTGTAAG 

ACGACATTGTGGAATAGCAAGGGCAGTTCCTCGCCTTAGGTTGTAAAGGGAGGTCTTACT 

ACCTCCATATACGAACACACCGGCGACCCAAGTTCCTTCGTCGGTAGTCCTTTCTACGTG 

ACTCCTAGCCAGGAGAGCTCTTAAACCTTCTGCAATGTTCTCAAATTTCGGGTTGGAACC 

TCCTTGACCACGATGCTTTCCAAACCArc 

ACCCCGGGGTCCAGTGCTTGGGCCTTCTCCTGGGTCATCTGCGGGGCCCTGCTCTATCGC 

TCCCGGGGGCACGTCAGGCTCACCATCTGGGCCACCTTCTTGGTGGTATTCAAAATAATC 

GGCTTCCCCTACAGGGTGGAAAAATGGCCTTCTACCTGGAGGGGGCCTGCGCGGTGGAGA 

CCCGGATGATGATGACTGACTACTGGGACTCCTGGGCCTCTTTTCTCCACGTCCACGACC 

TCTCCCCCTGGCTCTTTCACGACTTCCCCCCCTGGCTCTTTCACGTCCTCTACCCCGGCG 

GCCTCCACTACCTCCTCGACCCCGGCCTCCACTACCTCCTCGACCCCGGCCTCCACTGCC 

TCCTCGACCCCGGCCTCCACCTCCTGCTCCTGCCCCTCCTGCTCCTGCCCCTCCTCCTGC 

TCCTGCCCCTCCTGCCCCTCCTGCTCCTGCCCCTCCTGCCCCTCCTGCTCCTGCCCCTCC 

TGCCCCTCCTGCTCCTGCCCCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGCCCCTCC 

TCCTGCTCCTGCCCCTCCTGCCCCTC 

CCCTCCTGCCCCTCCTGCTCCTGCCCCTCCTGCTCCTGCCCCTX:CTGCTCCTGCCCCTCC 

TGCTCCTGCCCCTCCTGCCCCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGCTCCTGC 

CCCTCCTGCCCCTCCTGCCCCTCCTGCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGC 

CCCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCC 

TCCTGCTCCTGCCCCTCCTGCCCCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGCCCC 

TCCTCCTGCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGCCCCTCCTGCCCCTCCTCC 

TGCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGCCCCTCCTGCCCCTCCTGCCCCTCC 

TCCTGCTCCTGCCCCTCCTCCTGCTCCTGCCCCTCCTGCTCCTGCCCCTCCCGCTCCTGC 

TCCTGCTCCTGTTCCACCGTGGGTCCCTTTGCAGCCAATGCAACTTGGACGTTTTTGGGG 

TCTCCGGACACCATCTCTATGTCTTGGCCCTGATCCTGAGCCGCCCGGGGCTCCTGGTCT 

TCCGCCTCCTCGTCCTCGTCCTCTTCCCCGTCCTCGTCCATGGTTATCACCCCCTCTTCT 

TTGAGGTCCACTGCCGCCGGAGCCTTCTGGTCCAGATGTGTCTCCCTTCTCTCCTAGGCC 

ATTTCCAGGTCCTGTACCTGGCCCCTCGTCAGACATGATTCACACTAAAAGAGATCAATA 

GACATCTTTATTAGACGACGCTCAGTGAATACAGGGAGTGCAGACTCCTGCCCCCTCCAA 

CAGCCCCCCCACCCTCATCCCCTTCATGGTCGCTGTCAGACAGATCCAGGTCTGAAAATT 

CCCCATCCTCCGAACCATCCTCGTCCTCATCACCAATTACTCGCAGCCCGGAAAACTCCC 

GCTGAACATCCTCAAGATTTGCGTCCTGAGCCTCAAGCCAGGCCTCAAATTCCTCGTCCC 

CCTTTTTGCTGGACGGTAGGGATGGGGATTCTCGGGACCCCTCCTCTTCCTCTTCAAGGT 

CACCAGACAGAGATGCTACTGGGGCAACGGAAGAAAAGCTGGGTGCGGCCTGTGAGGATC 

AGCTTATATCACTAGTGAATTCGCGGCCGCGTTAGGGTGTGGAAAGTCCCCAGGCTCCCC 

AGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTC 

CCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAT 

AGTCCCGCCCCrAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCC 
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GCCCCATGGCTGACTAAriMil-rriATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGA 

GCTATTCCAGAAGTAGTGAGGAGGCl"l"l"nTGGAGGCCTAGGCTTTTGCAAAAAGCTTGC 

ATGCCTGCAGGTCGGCCGCCACGACCGGTGCCGCCACCATCCCCTGACCCACGCCCCTGA 

CCCCTCACAAGGAGACGACCTTCCATGACCGAGTACAAGCCCACGGTGCGCCTCGCCACC 

CGCGACGACGTCCCCCGGGCCGTACGCACCCTCGCCGCCGCGTTCGCCGACTACCCCGCC 

ACGCGCCACACCGTCGACCCGGACCGCCACATCGAGCGGGTCACCGAGCTGCAAGAACTC 

TTCCTCACGCGCGTCGGGCTCGACATCGGCAAGGTGTGGGTCGCGGACGACGGCGCCGCG 

GTGGCGGTCTGGACCACGCCGGAGAGCGTCGAAGCGGGGGCGGTGTTCGCCGAGATCGGC 

CCGCGCATGGCCGAGTTGAGCGGTTCCCGGCTGGCCGCGCAGCAACAGATGGAAGGCCTC 

CTGGCGCCGCACCGGCCCAAGGAGCCCGCGTGGTTCCTGGCCACCGTCGGCGTCTCGCCC 

GACCACCAGGGCAAGGGTCTGGGCAGCGCCGTCGTGCTCCCCGGAGTGGAGGCGGCCGAG 

CGCGCCGGGGTGCCCGCCTTCCTGGAGACCTCCGCGCCCCGCAACCTCCCCTTCTACGAG 

CGGCTCGGCTTCACCGTCACCGCCGACGTCGAGGTGCCCGAAGGACCGCGCACCTGGTGC 

ATGACCCX3CAAGCCCGGTGCCTGACGCCCGCCCCACGACCCGCAGCGCCCGACCGAAAGG 

AGCGCACGACCCCATGGCTCCGACCGAAGCCACCCGGGGCGGCCCCGCCGACCCCGCACC 

CGCCCCCGAGGCCCACCGACTCTAGCTAGAGGATCATAATCAGCCATACCACATTTGTAG 

AGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTGAACCTGAAACATAAAATGA 

ATGCAATTGTTGTTGTTAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATA 

GCATCACAAATTTCACAAATAAAGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCA 

ACGCGTCGGTACCAGCTTTTGTTCCCTTTAGTGAGGGTTAATTTCGAGCTTGGCGTAATC 

ATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACG 

AGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAAT 

TGCGTTTCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATG 

AATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCT 

CACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGC 

GGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGG 

CCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCG 

CCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGG 

ACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGAC 

CCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCA 

TAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGT 

GCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTC 

CAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAG 

AGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACAC 

TAGAAGAACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGT 

TGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAA 

GCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGG 

GTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAA 

AAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTAT 

ATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGC 

GATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGAT 

ACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACC 

GGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCC 

TGCAACnTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAG 

TTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACG 

CTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATG 

ATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAG 

TAAGTTGGCCGCAGTGTTATCACrCATGGTTATGGCAGCACTGCATAATTCTCrrTACT 

CATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGA 

ATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCC 

ACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTC 

AAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATC 

TTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGC 

CGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCA 

ATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTAT 

TTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCAC 
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